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Preface 


This book is intended to be a realistic first course in quantum mechanics 
and as such no claim is made that the subject is covered comprehensively. 
However, it is hoped that the material included will not only give a good 
basic grounding in the subject but will introduce the student to the more 
advanced techniques essential for a serious study. , 

After a brief survey of classical mechanics and of the historical back- 
ground of Quantum Theory, a postulate statement is given for Quantum 
Mechanics. One chapter is devoted to matrix representation and two 
chapters to approximate methods of solution. Group theory is introduced 
in chapter nine. This is a topic of growing importance. The final chapter 
is an introduction to Dirac’s relativistic theory. 

This book should be useful to first degree students in physics, 
chemistry and mathematics and also to research workers in industry, 
polytechnics and universities. 

Problems are included at the end of each chapter and a few of these 
have been used to extend the theory slightly. SI units are used and the 
charge on the electron is taken as —e = —1-602 x 107!° C. A list of 
references and further reading is given at the end of each chapter. 

I would like to thank my first teacher of the subject Professor H. C. 
Bolton whose influence I gratefully acknowledge. 


Tynemouth, H.C. 
June 1973, 


CHAPTER 1 
Classical Dynamics 


1.1. Introduction 


Although classical mechanics does not apply to the dynamics of atomic 
systems it certainly affords an excellent approximation to a wide range 
of phenomena. Quantum mechanics is a more general theory which con- 
tains classical mechanics as a limiting case and in fact historically quan- 
tum mechanics was developed by analogy with classical theory although 
this approach is not followed in this text. 

It is important that the reader understands something of the more 
general formulations of classical mechanics and is familiar with the terms 
Lagrangian, conjugate momentum, Hamiltonian and Poisson bracket. A 
brief review is given in this chapter but for a more complete treatment 
the reader is referred to Principles of Mechanics, by Synge and Griffith, 
or Classical Mechanics, by Goldstein. 


1.2 The Lagrange Equations 


The configuration of a mechanical system is determined when the values 
of a set of ‘generalized co-ordinates’ are given. The set of co-ordinates 
chosen may be determined by the symmetry of the system. Some or all 
of these generalized co-ordinates can be arbitrarily varied independently 
without violating the constraints of the system and the number of such 
co-ordinates is called the number of degrees of freedom of the system. 
In a holonomic system the number of degrees of freedom is equal to the 
number of co-ordinates. A smooth sphere in contact with a fixed plane 
constitutes a holonomic system, five co-ordinates being necessary to 
define the configuration, each of which may be varied independently. 
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On the other hand a rough sphere rolling on a fixed plan forms a non- 
holonomic system as there are now only three degrees of freedom whereas 
the number of co-ordinates is still five. The rest of this chapter is con- 
cerned with holonomic systems only. 


0 x 


Figure 1.1 A smooth sphere in contact with a fixed plan. The mass centre is 
specified by two co-ordinates (x, y). These angles (Eulerian), specify the orientation 
of the sphere. There are five degrees of freedom. 


Lagrange’s equation can be derived from Newton’s laws of motion. 
Consider a system composed of N particles of masses mj; (i= 1,...,.N). 
At time ¢ the position vector r; of the particle is determined by the set 
of n generalized co-ordinates qj (j=1,...,m), ie. 


t=; (1,---9n t) (1.1) 


(for N free point particles n = 34). If it is assumed that the system has 
fixed constraints (i.e. scleronomic), then r, does not explicitly involve 
the time. 

The velocity of the particle is then 


. dy war dg 
oH = > Me (1.2) 


The derivatives q; are called the generalized velocities of the system. 
The particle-velocity i; is a function of all the variables q;, gj and from (1.2) 


ai, _ 2m i 
and 
dt qj; 0qj; 


Suppose the particles each undergo a small displacement consistent 
with the constraints of the system such that all the forces remain con- 
stant during the displacement. In this virtual displacement the total work 
done is 


8W => F,; .51; (1.5) 
i 
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where F; is the force on the ith particle and 6r; is its displacement. From 
Newton’s laws of motion 


F; = mit; Ql .6) 
and so 
bW= Dm; . 5r;. (1.7) 
i 


If it is assumed that the internal forces do not work then F; may be 
identified with the externally applied force. 
Since 


or; 
81> > ay, Oa 
a] 


then from 1.3 and 1.7 


Or; 
sw=3(5 mii; —* \ 8q;. (1.8) 
i\i 0G; 
The total kinetic energy of the system is 
T=4 2 mit? (1.9) 
or; 
aT 23m 7 (1.10) 


and 


a (er _ dh _ d 
= (=) => m,(#,.—2) +m, li,.— —*)}. ; 
rf (37) 2m ( a 2m; (i 7 2 | (1.11) 


aT ae 
g (32) S mii; - eo (1.12) 
F tae 


d oT oT 
bW= (oz - 2 )say (1.13) 
, 
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Generalized forces Q; may be introduced so that (c.f. (1.5)) 
5W= 20,54; (1.14) 
j 
and these generalized forces are given in terms of the applied external 
forces. 


As the system is holonomic it is possible to consider a displacement 
involving one degree of freedom only, e.g. 


6q; #0, 5q; =0 TFL 
From (1.13) and (1.14) 


ag Ae a Oe i=1,...,m. (1.15) 


Only conservative forces will be considered and they can be derived 
from a potential function V(q,, . . ., dn) where 5W =—5V, ie. 


—5V = 20,54; 
i 
and so 
aV 
=_—: (1.16) 
Q; 0q; 
Equation (1.15) becomes 
d oT oT OV 
— (1.17) 


As the potential function does not depend on the generalized velocities 
it is possible to define a Lagrangian 


L@, --94n> q1> rr) Gn) = TQ, 69 Ons a1, ery Qn) 


—V(q1.---54n) (1.18) 
such that 
d aL ab 
——-—-_— =-0 f=1,..., ° 1.19 
dt 0g; 9q; ; # ( ) 


This final result is Lagrange’s equations for a conservative holonomic 
system. 

As an almost trivial example consider the motion of a compound 
pendulum. A uniform rod of a length 2a and mass m is free to rotate 
about a horizontal axis through one end. This is a holonomic system 
with one generalized co-ordinate, the angle @ (see Fig. 1.2). 
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Figure 1.2 Compound pendulum. 


The kinetic energy of the system is T= 416? where J is the moment of 
inertia about the pivot, i.e. 


T =4ma*6?. 


With a suitable choice for zero the potential energy is V = —mga cos 8. 
The Lagrangian function is then 


L= 3ma6? + mga cos @ 
and Lagrange’s equation becomes 
dab +g sin 0 =0. 


For small oscillations the motion is simple harmonic with frequency 


1.3. Hamilton’s Equations 
To derive Hamilton’s equations it is necessary to generalize the concept 


of momentum. The Lagrangian of a point particle moving in a conserva- 
tive field is 


= Ge? +3? +37) — VGs », 2). 
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The components of linear momentum in the x, y and z directions are 


= mx = oE ei Ob, = , 0h 
Px ax? Py iy ap’ Pz = mz a5 


Accordingly, when the Lagrangian is expressed in terms of a set of 
generalized co-ordinates g;(j = 1, . . .,2) the n components of generalized 
momenta may be defined by 


Pir ms : ad .20) 


p; is the momentum conjugate to q;. 
For the compound pendulum discussed above 


Pe = 4ma6 = 16. 
A Hamiltonian function is defined by 
H= 2 pidy — L. (1.21) 


From (1.20) the components of momentum are expressed in terms of the 
q’s and q’s. This set of equations can be solved.for the q’s in terms of the 
q’s and p’s and in this way the Hamiltonian is expressed as a function of 
the q’s and p’s, 


H=H(Q),-.-sQn>Pis+++sDn)- (1.22) 


It is clear from (1.2) and (1.9) that the kinetic energy is a homogeneous 
quadratic function of the q’s and so from Euler’s theorem 


oT 
24 x 227. (1.23) 
7 | 04; 


From (1.18) and (1.20) p; = 97/0; and so for a conservative field (1.21) 
becomes 


H=2T-(T-V) 
=T+V. (1.24) 


The Hamiltonian is equal to the sum of the potential and kinetic 


energies. 
To obtain Hamilton’s equations consider a small change in the 
generalized co-ordinates. In terms of differentials 


OL 
dH = > dp;q; + ; dg; -— > — dq; - =~ dq;. 1.25 
2 PiQi 2. Pi qi 2a dq; a ag; qi ( ) 
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By definition of the generalized momenta p; = dL/dq; and from 
Lagrange’s equations (1.19) 


OL 


dH = 24: dp; — 2Pi dq;. 


As 
H =H, «+9 OnsPis- + Pn) 
then 
oH oH 
—=-p; and — = qj. (1.26) 
qi : ap 


These final relations are Hamilton’s equations. (Canonical equations.) 

Hamilton’s equations form a set of 2n first order equations in the 2n 
variables q; and p; and the co-ordinates and momenta enter the theory 
on equal basis. The complete solution of Hamilton’s equations involves 
2n arbitrary constants. If the values of all the co-ordinates and momenta 
are known at some instant then the future state of the system is uniquely 
determined by Hamilton’s equations. 

As a simple illustration consider a point mass moving under gravity. 
The kinetic energy is 


T= (a? +9? +27) 


and the potential energy is V = mgz where z is the perpendicular height 
from some chosen zero and g is the acceleration due to gravity. 
The Lagrangian is 


LF +? +27) — mgz 


and the momenta conjugate to x, y, z are 
ab 


Tae =mx, Py =my, Dz =z. 


Px 


The Hamiltonian is 


H=Tt+vV 


1 
= 5, (pi + Py + pz) + mgz. 
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Note that the Lagrangian is used to relate g; and p; and hence to obtain 
the Hamiltonian. 

A more interesting example is afforded by the rigid rotator. This 
system is composed of two point masses 7, and mm, a fixed distance 


x 


Figure 1.3 Rigid rotator. C is mass centre. 


apart, rotating about the mass centre C, and arises in discussing the 
energy levels of diatomic molecules. Let C coincide with the origin of 
the co-ordinate system and let 711, m2 be at the distances r,,72 respect- 
ively from C. This is a holonomic system with two degrees of freedom, 
the polar angles @ and ¢. The potential energy of the system is zero and 
the kinetic energy is 


T=4m,Vi +4mV3 


where V;, V2 are the speeds of m,, mz respectively. In terms of the 
polar angles 


T =4m,[r262 +r? sin? 062] + 4m, [r362 +73 sin? 047). 
The moment of inertia about the mass centre is 


2 2 
1 +MoPr2 


l= mir 
and the Lagrangian is 


L=T=41(6? +sin? 662). 
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The momenta conjugate to @ and ¢ are 


Ly Mi eg 
Po = 3g = 18, Po= 35 =I sin’ 0¢ 


and the Hamiltonian of the system is 


1 
H(, $, Pe, Po) = T= (3 + 


1 2 
sin? 9 7} 
Conservation of energy 


The total energy of the system is 


E=T+V=H(Q,.-4nsP1>--->Pn) 


and the rate of change of the energy is 


dE (a+ Ma) 
—- = — = x Pi 
dt ; aq; f ap; é 


-y (22 24 2H 2) 
7 \ 94; OP; 9D; 84; 
=0. (1.27) 


Throughout the dynamical motion the energy remains constant. This 
result of course was to be expected as only conservative forces have 
been considered (1.16). 


Poisson brackets 


Suppose F and G are two functions of the 2” variables q;, p;. The Poisson 
bracket of F and G, denoted {F, G} is defined by 


{F, G}=>. (= ee 221, (1.28) 
j i OP; j OF 
It is immediately obvious that 
{F, G}=—{G, F} 
and 
{F, F} =0. (1.29) 
Simple important cases arise when F and G are q; and p;. 


{qi,4;} =0, {Pi pj } = 9, {qi, Dj} = i, (1.30) 
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where 6,; is the Kronecker delta defined by 
5;; =0 i#j 
=1 i#j. 
In classical mechanics Poisson brackets occur when discussing the 


rate of change of functions of the 2n variables q; and p;. F is such a 
function and 


fy (Hy +%5)->(F #- EM 
dt & \dq; 7 ap") <\8qy ap; ap; 2a; 


oF oie Hh: (1.31) 


If {F, H} =0 then Fis a constant of the motion. 
There is a correspondence between Poisson brackets and the com- 
mutators of quantum mechanics. 


1.4 Time Involved Explicitly 


In the more general case it may be possible to describe a system by a 
Lagrangian satisfying (1.19) but which explicitly involves the time. The 
conjugate momenta are defined in the usual manner (1.20) and so a 
time-dependent Hamiltonian may be found (1.21). Hamilton’s equations 
(1.26) still follow from the analogue of (1.25) but in addition there is the 
result 


Oia Ue, (1.32) 

ot or 
The appropriate generalization of (1.27) is 

dH 0H 

—_—=—. 1.33 

dt dt (22) 
If F explicitly involves the time then (1.31) becomes 

dF OF 

— ={F, AH} +=. 34 

dt { } ot oe) 
A system is conservative if either of the equivalent statements 

aL oH 

—=0, a2 =0 1.35 

ot ot O23) 


is satisfied. 
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An electron in an electromagnetic field 


For some systems it is not always obvious what the precise form of the 
Lagrangian should be. Consider an electron with charge —e in an electro- 
magnetic field with electric vector & and magnetic induction vector B 
(S.I. units). The force exerted on the electron moving with velocity r is 


=—e[@+r AB}. (1.36) 


The field may be described by the scalar and vector potentials $ and A 
with 


dA 
B=curlA and & = —grad o-a: (1.37) 


In general these potentials are functions of position and time. These 
equations do not specify ¢ and A uniquely and it is convenient to 
impose the condition 


a 
div A + ue S% =0. (1.38) 


and e are the permeability and permittivity of the medium. The 
equation of motion of the electron (mass m) is then 


0A . 
mi=F =e grad ¢ +e —-— et Acurl A. (1.39) 


It will now be shown that the Lagrangian that describes the system is 
L(r, i,t) = 4mr? + eg — et. A. (1.40) 


The generalized co-ordinates will be taken to be the cartesian co- 
ordinates of the electron and so, 


—= gS. 1.41 

ax; ax; 2 Ox, ie 
Also 

ae 

ax; = Mx; — eA;. ad 42) 


Lagrange’s equations (1.19) become 


a 0A 0A 
mee Me 2At of, (M2 - ) 


0x4 0x2 
+x 0A3 0Ay 
3\ ax,  ax3 


and similarly for x2 and x3. These equations are simply the components 
of (1.39) so verifying that (1.40) is the correct Lagrangian. 
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The momentum conjugate to x; is from (1.42) 
Pi = mx; — eAj. (1.43) 


The conjugate momentum is not equal to the linear momentum unless 
A = 0, i.e. unless there is no magnetic field. The Hamiltonian is 


H = pqj-L=p.t-L 
j 
= (mi — eA).t—4mi’ — edtet.A 
=4m?F — ed. (1.44) 


In terms of conjugate momenta 


H(t, p, t) = = (p +eA)? — e¢. (1.45) 
It is a simple matter to confirm that 

aH ab 

at ot” 


If the fields are static then ¢ and A do not depend explicitly upon the 
time and consequently neither do LZ nor H and so the energy is conserved. 
In a purely magnetostatic field ¢ = 0 and the speed of the electron is a 
constant of the motion. 


1.5 Hamilton’s Principle 


The calculus of variations 


Let A and B be two fixed points in the xy-plane with co-ordinates 
(x1, ¥1), 2, ¥2) and let F be a functional form of the independent 
variable x, a function y(x) and the derivative y’ =dy/dx. For a given 
curve y = y(x) through A and B the integral 


B 
=| Fy, y')ax (1.46) 
A 


has a definite value. This integral will in general take different values for 

different curves through the fixed points A and B. It is desired to find 

the curve y = ys(x) which makes / take a stationary value, if this exists. 
Let g(x) be an essentially arbitrary differentiable function such that 


g(x1)=g(x2)= 0. (1.47) 


Then y(x) =y,(x) + g(x) where £ is small is the equation of a neigh- 
bouring curve through the points A and B. 
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Figure 1.4 


The integral (1.46) may be regarded as a differentiable function of 8. 
B 


1(8) =| FG, vse) + Bae), val) + Be'(x)) dx. 
A 


Since this integral is stationary when 6 = 0 then /'(0) = 0. Differentiating 
under the integral sign gives 


B 
[ BOF, G vo ys) +8'@)Fy' Yo ¥s)] dx = 0. (1.48) 
A 


The second term can be integrated by parts and using (1.47) equation 
(1.48) becomes 


B 
d 
{ g(x) Ee Ys Vs) — ay fr" =) dx =0. 
A 
Since g(x) is arbitrary, the condition for / to take a stationary value is 


oF da {oF 
OE ae ae, 1.49 
ay dx (5) ’ ox 


This differential equation for y is called the Euler-Lagrange equation. 
The extension of the theory to include n variables y;(j = 1,...,7) is 
straightforward and the condition that the integral 


B 
T= | FQ Y1,-- Yas Yio Yn) ax (1.50) 
A 
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has a stationary value with respect to variations in the y’s is that 


i 
iy ax ay; j=1,...,n. (1.51) 


If the identification 
xt 


vier "> 
t 


VYirg 


is made, then the above theory shows that the Lagrange equations 


a (ab) ax, 
are equivalent to choosing the trajectories of the co-ordinates q;(t) so 
that 


B 
Bf LG du. nsGs- +n) dt =O. (1.52) 
A 


A represents the fixed initial configuration of the system at time ¢,, and 
B the fixed final configuration at f,. This is Hamilton’s principle. Of 
course for conservative systems the Lagrangian does not depend explicitly 
on the time and the variational principle determining the equations of 
motion (1.19) is 
B 
5 { (7- V) dt =0. (1.53) 
A 
Generally the stationary value of the integral is a minimum. 


1.6 The Hamilton-Jacobi Equation 


The Hamilton-Jacobi equation is of special interest since the quantum 
mechanical Schrodinger equation includes the classical Hamilton-Jacobi 
equation as a limiting case. 
Define the action integral 
B 
S= [L(G q1. +1 dne Gis ++ dn) at (1.54) 
A 
Suppose the integral (1.54) is evaluated along the path in the (» + 1) 
dimensional space defined by q1,. . ., dn, t representing the natural 
motion of the system from a point A to a point B. This is the unique 
stationary path and the value of S must only depend upon the values of 
the co-ordinates and time at A and B. 
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Consider now a neighbouring stationary path from the same initial 
point A to a new end point B,. This is possible since the velocity com- 
ponents have not been defined at A. Let dqig,i=1,....7 and 5tp be the 
displacement of B, from B. The change in S can be expressed in terms 
of these increments. Only an outline of the argument is given. (For 
further details see the references at the end of the chapter.) 

With the aid of (1.21), the integral (1.54) becomes 


B 
s=| (Sp. da; — Hrs. »dmdi- dm) 2). (1:55) 
A t 


After some manipulation, the variation in S is 
8S = > pipdqin — HpStp 
t 


B 


+ j (= Sp; dq; — >.5q; dp; — 5H dt + 6t an). (1.56) 
A t 


Note that there is no reason to take 55 = 0 (c.f. equation 1.52) since 5S 
is the variation in S between two stationary paths. The variation in H is 
given by 


oH oH 0H 
a= 2, (# 8qi +2 5p, se Pa (1.57) 


and so 


5S => pinddin — Hpdtp 
t 


B 
+f [2|(@ 7 a 5pi — (2 2H) ss | + (a2) sr 
A t 
(1.58) 


As the integral is taken along a stationary path representing a natural 
motion of the system, Hamilton’s equations (1.26) are satisfied (also 
(1.33)) and so the integral is zero. 


5S => pip — Hpdtp. (1.59) 
i 
Replace q;p and tg by q; and t to represent any point on the path. 
As the increments are independent 
S=S@1,.-+4nst) (1.60) 
with 
os 


dg, 
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and 
as 
ap G++ Ine Piss + Prot). (1.61) 
Combining these two equations 
as as as 
tA, -- dn D+ + Tot] 0. : 
at (« qn aq, ie 0 (1.62) 


This partial differential equation satisfied by S is the Hamilton-Jacobi 
equation. For a conservative system the Hamiltonian does not explicitly 
involve the time and the equation reduces to 


ij as 5) 8 ve 
A endnr e reee an ar ( . ) 


where £ is the energy of the system. 
For a rigid rotator with moment of inertia J the Hamiltonian is 


and the Hamilton-Jacobi equation is 


zs as\ | 1 (asp) _ as 
21\\ae} — sin? eae Ot. 


The solution of a dynamical problem can be reduced to the solution 
of the Hamilton-Jacobi equation but this will not be pursued here. The 
interesting relationships between the Hamilton-Jacobi equation and the 
Schrédinger equation of quantum mechanics is mentioned in chapter 
four. Schrédinger used this equation in the development of wave mech- 
anics. By comparing Hamilton’s principle with Fermat’s principle an 
analogy is obtained relating particle trajectories to rays and Hamilton’s 
function S to the phase of a wave. 


PROBLEMS 


1 A particle mass of m distance r from the origin, is moving in a plane 
under an attractive force m/r? directed towards the origin. Write down 
the kinetic and potential energies and verify that Lagrange’s equations 
may be written . 


F 72, um d 24 
— mro? += =0, — (76) = 0. 
mF — mr 2 mae ) 


2 A double pendulum consists of two uniform rods AB and BC each of 
mass m and length / hinged together at B. The end A is hinged at a fixed 
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point O so that the rods can oscillate in a vertical plane. The configuration 
of the system is specified by the angles @ and ¢ between the rods AB, BC 
and the vertical. Write down the kinetic and potential energies and 

obtain the two Lagrange equations. Assuming the oscillations are small 
find the solutions of Lagrange’s equations of the form 


6g =Aje" $= Aze!! 


where A,, A are constants. Confirm that the two normal frequencies 


are 
V8 »_ v2 
2 = 


9 VO 
Or et) and 1/2 +1) 


where g is the acceleration due to gravity. 
3 Consider a system which moves according to equation (1.15) 
d / oT oT 
s Pas boa ae 
dt \0q; } 9a; 
Integrate this equation with respect to time from t = 0 to f = r. Proceed 


to the limit 7 > 0 and assume that the generalized force Q; > °° in such 
a manner that the impulsive force 


; 
I,= lim | Qae 
T>0 6 
is finite. Using L = T— V show that 


OL 
1, =6 i = 6pj. 
0qi 


That is, the change in the generalized momentum is equal to the 
generalized impulsive force. 


4 A particle of mass m undergoes simple harmonic oscillations along 
the x-axis. The controlling force is directed towards the origin and has 
value —kx. Write down the kinetic and potential energies and hence the 
Lagrangian. Find the momentum conjugate to x and show that the 
Hamiltonian of the system is 


1 1 
H=— 24 2. 
2m? as 


5 Atwo dimensional harmonic oscillator has kinetic energy T and 
potential energy V given by 


=7@ +) va ei +43). 
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Derive Lagrange’s equations of motion and find the conjugate momenta 
P1; Pz. Show that the Hamiltonian of the system is 


J 
H(p, Dera [pi + pz + m?w(qi + q3)). 


New co-ordinates Q,, Q2 with momenta P,, Pz are defined by the 
transformation 


[2 : [2 : 
qa mo V1 sin 1 q2 = Tro V2 Sin Or 


Py =+/2mw/P, cos OQ; P2 =v/2mwr/P2 cos Q2. 
Confirm that the transformed Hamiltonian is 
H'=w(P, +P). 


A necessary condition for this transformation to be canonical is that 
Hamilton’s equations (1.26) are satisfied for this new Hamiltonian. 
Show that these equations are 


P,= 0 and 0; =W i= 1, 2 
and verify that they are indeed satisfied. 


6 Prove the following properties of Poisson brackets 

(i) {E+ F, G}={£,G}+ {FG} 

(ii) {F, {G, K}} + {G, {K, F}} +{K, {Fh G}} =0 
where £, F, G, K are all functions of q; and p;. 
7 Suppose A, B are two fixed points in space, A higher than B and 
choose the two co-ordinate axes so that both points lie in the (x, y) 
plane. Let the co-ordinates of A and B be (;, ¥1), (x2, 2) respectively. 
y =y(x) is the equation of a curve joining A and B. A smooth particle 


starts from rest at A and falls under gravity down the curve. 
Show that the time taken to reach B is 


ely ,_dy 
#! Gi where ae 
Show that T is a minimum when the curve joining A and B satisfies 
(1 +y’*)(y, — y) = constant. 
Introduce the parameter y with y’ = tan y and derive the solution 
y—y, =ACI t+cos 2p) 
x=B- A(2y +sin 2p). 
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These are the parametric equations of a cycloid. (This is the famous 
brachistochrone problem). 


8 The Hamiltonian for a particle of mass m moving vertically under 
gravity is 


1 
H(x, p) awe + mgx 


where x is the height of the particle above the zero for the potential 
energy. 
Obtain the Hamiltonian-Jacobi equation. Verify that 


2 2 
S=-a,t+ fee — mgx)*? +a, 
m 3g 


is a solution where a, and a are constants. Using equation (1.61) 
explain the physical significance of the constant a, and find the 
momentum as a function of x. 
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CHAPTER 2 


Origins of the Quantum 


Theory 


2.1 Failure of Classical Mechanics 


The classical mechanics formulated by Newton and developed by 
Lagrange, Hamilton and others, is very successful in explaining the 
dynamical motion of macroscopic objects and before the beginning of 
this century was universally accepted as being the ultimate theory of 
dynamics. 

A similarly satisfactory state of affairs existed concerning the theory 
of light. The work of the Frenchman A. J. Fresnel had led to the accept- 
ance of the wave nature of light. Later the Scotsman Clerk Maxwell 
developed the electromagnetic theory of light and in 1887 the German 
physicist Hertz confirmed experimentally that electromagnetic waves 
are emitted by oscillatory electric charges. 

However, by the end of the nineteenth century these classical theories 
were unable adequately to explain a growing number of small-scale 
(atomic) phenomena. Physicists were beginning to explore the structure 
of matter and the nature of radiation and the interaction of radiation 
with matter. As a consequence, the foundations of physics were radically 
re-examined and the first quarter of the present century was to see the 
development of the theories of relativity and quantum mechanics. This 
chapter attempts to trace the history of the more obvious failures of 
classical mechanics and how they were overcome, firstly by Planck, 
Einstein and Bohr, and later by Schrédinger and others. 
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Black-body radiation 


An obvious break-down in classical theory occurred in the attempts to 
account for the continuous spectrum emitted by a black-body. A black- 
body is defined as one that absorbs all incident electromagnetic radiation. 


\ ———-— Rayleigh-Jeans 


Experimental 


E(A) 


0 A 


Figure 2.1 Radiation density from a black-body radiator. 


It follows from thermodynamics that such a body is a better radiator at 
every frequency that any other body at the same temperature. The 
frequency distribution of the spectrum depends only on the temperature 
and not on the material of the black-body. A true black-body does not 
exist but it is clear that the radiation emitted from a small opening of a 
hollow enclosure is equivalent to black-body radiation. 

In 1879 Stefan suggested that the rate of heat loss from a hot body is 
proportional to 7+ where T is the absolute temperature and Boltzmann, 
using classical thermodynamics was able to derive the result 


where £ is the total radiation density from all frequencies and o is a 
constant; c is the velocity of light. Wien in 1893 also used a classical argu- 
ment to show that the wave-length, which is emitted with maximum 
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intensity from a black-body, is proportional to 1/T. This is the Wien 
displacement law and is in agreement with experiment. 

So far classical theory was adequate. However, there was no theory 
to explain the observed frequency distribution of the black-body 
radiator (Fig. 2.1). Wien proposed a semi-empirical theory which agreed 
in the short-wave limit while Lord Rayleigh followed by James Jeans 
gave a theoretical result based on classical reasoning which agreed at long 
wave-lengths but was in complete disagreement for short wave-lengths. 
The essentials of their argument are given below. 

The diffraction experiments of Young in 1803 and the Maxwell theory 
of electromagnetic radiation had led to the acceptance of the wave nature 
of light. Rayleigh and Jeans used wave theory together with the classical 
law of equipartition of energy to obtain the Rayleigh-Jeans formula. 
Following Hertz, the emission and absorption of radiation was taken to 
be due to oscillatory charges within the black-body. It was assumed that 
there are a great number of linear harmonic oscillators with all possible 
frequencies in equilibrium at the temperature in question. 

Consider such a system. Let E(v)5v be the density of radiation energy 
per unit volume with frequencies between vp and p + 5p. It can be shown 
(Classical Electricity and Magnetism, p. 336 by Panofsky and Phillips) 
using classical electromagnetic theory that when the oscillators are in 
equilibrium with the radiation in a black-body cavity 

8 Z 
Ev) > u(v) (2.1) 
where u(v) is the average energy of the oscillators with frequency v and 
c is the velocity of light. 

The classical ‘Law of Equipartition of Energy’ states that the mean 

energy of the oscillators with frequency v is 


u(v) =kT (2.2) 


where k is Boltzmann’s constant and T is the absolute temperature. From 
(2.1) and (2.2) 


8 2 
E(v)$v = kT 8p 
c ‘ 
or 
81 
EQN = 5G RT BA. (2.3) 


This is the Rayleigh-Jeans law and it implies that the energy radiated 
in a given wave-length range 5A increases without bound as A becomes 
smaller. This non-physical result is of course experimentally incorrect 
(Fig. 2.1) and the Rayleigh-Jeans law is in agreement with experiment 
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only for long wave-lengths. Also, equation (2.3) implies that the total 
energy emission is 


E= { Eo) w= Ser |v? dp (2.4) 
0 ¢ (4) 
and clearly £ is infinite. The Rayleigh-Jeans law demands that the total 
energy radiated per unit time is infinite at all temperatures except T= 0. 
This obviously false result is often referred to as the ultra-violet 
catastrophe. 

Clearly there was a fundamental error in classical theory. 


2.2 Origin of the Quantum Theory 


Max Planck 


In 1900, before the publication of the Rayleigh-Jeans law, the German 
physicist Max Planck discovered an empirical relation that agreed with 
the entire continuous spectrum emitted by a black-body radiator. He 
then worked to find a physical explanation for his formula and succeeded 
in doing this after several weeks work. His results, published in 1901, were 
the origin of the quantum theory. The first quarter of the present cen- 
tury saw the subsequent development of the ‘Old Quantum Theory’ 
which was to be superceded in 1925 by the ‘wave mechanics’. 

Planck also assumed that a black-body is composed of oscillators in 
equilibrium with the radiation field and that equation (2.1) holds true. 
He realized that some drastic change was necessary to obtain a fit to 
the experimental data and his basic assumption was that the material 
oscillators can only have discrete energy levels rather than a continuous 
range of energies that had been assumed before. In particular Planck 
assumed that an oscillator with frequency v can only take the values 


€n =nhv n=0,1,2,... (2.5) 


where h is a universal constant, later to be known as the Planck constant. 
Planck chose the value h = 6:55 x 10-34 joule sec (units of action) to 
agree with the experimental results but the value accepted today is 


h = 6-624 x 107 34 joule sec. (2.6) 


Note that the difference between consecutive energy levels of a Planck 
oscillator is an extremely small ‘quantum of energy’. 

Planck did not introduce the concept of the photon. He merely 
suggested that matter absorbed or emitted radiation energy in discrete 
amounts. (i.e. multiples of hv). 

The argument below is not in the form given by Planck who used a 
thermodynamic derivation. Consider the N, oscillators with frequency v. 
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In equilibrium at temperature T the number of oscillators with energy e,, 
is, from the classical Boltzmann expression, equal to 


Nyenen/kT S en enlkT (2.7) 
n=0 
The mean energy per oscillator of frequency v is 
u(v) = > €,e enlkT > e7enlkT (2.8) 
n=0 n=0 
Substitution of equation (2.5) gives 
u(v) = hv|(e””/*T — 1), (2.9) 


The Planck expression for the density of radiation is obtained from 
(2.1) and (2.9) to be , 


8nv? hv 
E(v)= a (GET 1) 1’ (2.10) 


Planck’s law (above) agrees very closely with the observed radiation 
distribution when h is chosen correctly (see (2.6)). In the long wave- 
length limit 


hv 
hv/kT ~ 1 +— 
7 kT 


and Planck’s expression reduces to the Rayleigh-Jeans. The total radia- 
tion density arising from all frequencies can be obtained using (2.10) 
and 


mk 
ch 


°° 8 4 
E= | EQ) diate Tt (2.11) 
0 


The energy density is proportional to the fourth power of the absolute 
temperature. This result was first suggested by Stefan. 


Albert Einstein—The photo-electric effect 


In 1905, Albert Einstein an employee of the Swiss patent office, took the 
next important step in the development of the quantum theory. Planck 
had suggested that oscillators absorb or emit radiation in energy bundles 
but that in free space the radiation obeys the continuous laws of Maxwell. 
Einstein pointed out that these ideas are not compatible and suggested 
that electromagnetic radiation only exists in discrete energy bundles later 
to be called photons. This was a step that Planck had hesitated to take as 
it opened the old wave-particle dialogue about the nature of light. The 
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first major success in the new theory was Einstein’s explanation of the 
photo-electric effect. 

In 1887, Hertz in his ‘hoop’ experiments confirmed the existence of 
the electromagnetic waves predicted by Maxwell. During the same 
experiment he noticed that when light flashes from the transmitting 
circuit shone on the ends of his detector ‘hoop’, the weak sparks in the 
gap passed more easily. The Hertz effect was explained by Lenard who 
showed that a negatively-charged metal loses its charge when exposed to 
ultra-violet light because the electrons gain energy from the radiation and 
are emitted. This is now known as the photo-electric effect. Further 
experimental investigation of this effect showed that the number of 
electrons emitted increases when the intensity of light is increased but 
that the maximum observed speed of the electrons only increases if the 
frequency of the light is increased. This final result cannot be explained 
by the classical Maxwell theory which clearly demands that the electron 
speed should depend upon the light intensity. Einstein gives the 
explanation in terms of photons. 

Einstein looked upon the photo-electric problem as one of particle 
collisions. He looked upon a monochromatic beam of light as being 
composed of ‘particles’ called photons, each of which has an energy hv 
where v is the frequency of the light. 


Electrons 


—————— TN 


Metal 
surface 


Light 


Figure 2.2 


Essentially, Einstein’s argument was as follows. When an incident 
photon collides with an electron in a metal surface, the energy of the 
photon is transferred to the electron and when the electron is emitted 
from the metal surface it has a kinetic energy given by 


4mv? =hv — P (2.12) 
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Q Yo Y 


Figure 2.3. Dependence of stopping potential on incident light frequency. 


where 4 mv? is the electron kinetic energy and P is the work required to 
extract the electron from the metal. P varies from metal to metal and is 
of the order of a few electron volts. 

The result (2.12) was verified by Millikan in 1916 in a series of 
experiments using sodium and potassium as the photo-electric surface. 
A stopping particle Vg may be defined by 


Voe = dmv. 
Vo is the applied potential required to prevent any emission of electrons 


and the experiment graph of eV against incident light frequency is 
linear as predicted by (2.12). The slope is equal to Planck’s constant. 


Specific heat of solids 


In 1906 Einstein applied quantum ideas to the theory of the specific 
heat of solids. 

Classical theory predicts that the heat capacity at constant volume is 
independent of temperature. Consider a solid composed of N identical 
atoms each of which can vibrate about its equilibrium position. The 
classical ‘Law of Equipartition of Energy’ demands that the mean energy 
of such a three-dimensional oscillator in equilibrium at absolute tempera- 
ture T is 3k7 and the mean energy of the solid is 


u = 3NKT. (2.13) 
For one mole, NV = 6-025 x 1073 (Avogadro’s number) and 
u=3RT (2.14) 
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where R is the gas constant. The lattice contribution to the molar heat 
capacity is 


a 
C= (24) = 3R ~ 6 cal/deg mole. (2.15) 
aT }y 


Classical theory predicts that the heat capacity per mole is the same for 
all substances at all temperatures. At room temperatures and above many 
solids obey this law but agreement fails entirely at low temperatures. 
Experimentally it is found that the heat capacity approaches zero as 
T>0. 

Einstein, following Planck, assumed a solid can be represented by a 
collection of harmonic oscillators which can only take discrete energy 
values nhv,n integer, where v is the oscillator frequency. Einstein con- 
sidered the simplest model and assumed all the oscillators have the same 
frequency Vp. 

The internal energy of the solid is then 


3hvo 
u=N (MoT — 1) (2.16) 
The lattice heat capacity at constant volume is 
au. [On\? __ ®°BT 
C= aT” 3Nk (*) [eoe/T “1p (2.17) 


where the Einstein temperature 0 is defined by hvg = k6,. The Einstein 
temperature for a material may be chosen to make equation (2.17) give 


c, —— Einstein 
—e— Experimental 


———— Classical 


Figure 2.4 Molar heat capacity. 
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a reasonable fit to the experimental results and is of the order of a few 
hundred degrees absolute. 

Einstein’s formula is in fair agreement with experiment for T> 67 
but at low temperatures when T <6 the agreement is poor. For non- 
metals, the heat capacity approaches zero as T3 whereas Einstein’s 
formula predicts 


/) 2 
ey -sme(*] e SEIT =—- TEGO x. (2.18) 


Debye (1912) successfully explained the 73 variation at low tempera- 
tures by considering a spectrum of frequencies to be present. 


2.3 The Hydrogen Atom 


Experimental spectroscopy developed rapidly during the second half of 
the nineteenth century. Because the emission of light was thought to be 
the result of vibrations, harmonic relations between the spectrum lines 
were looked for without success. However in 1885 Johann Balmer dis- 
covered that the wavelengths of all the spectrum lines of the hydrogen 
atom then known could be expressed by the formula 

see ee | 


b n=3,4,5,... 
where b is a constant. This equation is generally written 
kn _ 1 1 1 
—2=—=R{— _— 2.1 
2t On (3 =) G19) 
where k,, is the wave-number and R is the Rydberg constant with 
R= 1-097 x 107 m7“. 


The formula (2.19) gives excellent agreement with experimental results. 

Balmer and Ritz suggested that other spectral series might exist for 
the hydrogen atom with the first term different from 1/22. These have 
of course been found and the general expression for the wave-length of a 
hydrogen atom spectrum line is 


k 1 1 1 

a = > =R ( re | n >, each an integer. (2.20) 
Lyman found the ‘ultra-violet’ series with m = 1,n =2,3,4,...and 
Paschen discovered the ‘infra-red’ series with m = 3,n = 4, S56 5.6 0506 
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From (2.20), the wave number of any line of the hydrogen atom spectrum 
is the difference between members of the series of ‘terms’ 


th=—  n=1,2,... (2.21) 


The spectrum lines of other elements can also be obtained as the 
difference of two terms, i.e. 


However the terms ¢;, t; generally are more complicated than those used 
for the hydrogen spectrum. This is the Ritz Combination principle. 
Although these empirical formulae were very useful they had no theoretical 
explanation. 

In 1911 Ernest Rutherford suggested the ‘nuclear atom’. From his 
experiments on the scattering of a-particles Rutherford was lead to 
suggest that an atom consists of a small positively charged nucleus 
surrounded by outer electrons. (The electron had been established by 
J.J. Thomson in 1897.) 

Unfortunately classical deductions from Rutherford’s ‘nuclear atom’ 
do not explain the observed atomic spectra. No stationary configuration 
of charges can be in stable equilibrium under their own electrostatic 
forces (Earnshaw’s theorem) and so the electrons must be in motion 
relative to the nucleus. Such electrons must orbit in some way and the 
corresponding acceleration will result in energy loss by electromagnetic 
radiation. Hence electrons will gradually spiral into the nucleus. More- 
over the frequency of the radiation emitted will gradually increase as 
this spiralling continues and a continuous spectrum will be produced 
instead of the observed discrete spectral lines. 


Niels Bohr 


In 1913 the Danish physicist Niels Bohr combined the ‘nuclear atom’ 
concept of Rutherford with Planck’s quantum hypothesis to give a 
theory that accounted for the observed spectrum of hydrogen. Bohr 
made two basic postulates. 


(i) He assumed that the electron is a particle that can revolve about 
the nucleus only in certain allowed orbits which may be circular 
or elliptical. Each orbit represents a stationary energy state in 
which electromagnetic radiation is not emitted, in conflict with 
-the predictions of classical theory. 

The allowed orbits or states are those in which the angular 
momentum of the electron about the nucleus is an integral 
multiple of # =h/2t. 
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(ii) Emission or absorption of radiation occurs when the electron 
makes a quantum jump from one stationary state to another. 
When the atom makes a transition from a higher energy state 
E; to a lower energy state Ey Bohr suggested, (following Planck), 
that the energy difference is radiated as a photon of frequency 
pv with 


E,; — Ep = hv. (2.23) 
An outline of the Bohr theory is given below. 


Bohr used classical methods to calculate the energy of his allowed 
stationary states. The force of attraction between the hydrogen nucleus 
and the electron is 

2 
_ (2.24) 
4ne€or 


where r is the distance of the electron from the nucleus and €g is the 
permittivity of free space. The electron must move around the nucleus 
in an orbit that is a conic section. The simplest case is that of a circle 
and the inward acceleration is 


mv? 


=— (2.25) 
r 


where m is the electron mass and v is the speed. From Newton’s second 
law and equations (2.24) and (2.25) above, the kinetic energy is seen to be 


2 
(2.26) 


T ! mv 
=o v= - 
2 87€or 


The potential energy of the electron is given by 


4néor 


and so the total energy of the electron in a circular orbit of radius 7 is 
2 


E=T+V= (2.27) 


8néor 


The zero of energy is taken to be the ionized atom where 7 = =. 
From Bohr’s first postulate the radii of the allowed orbits are given by 


mor =nh n=1,2,3,.... (2.28) 
From (2.26) and (2.28) it is possible to eliminate v to give 
24,2 
Pe nh ance (2.29) 


me 
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and the energies of the allowed states are 
me* 1 


hoes SS 
4 2(4neo)*h? n? 


=1,2,... (2.30) 


When an electron jumps from the n; orbit to the nf orbit the 
frequency of the radiation emitted is given by 


poe 
h 

ie. 

ya —zmime* |] Ld ai 

(4m€9)*h? | n? on? | Gs) 

The wave number of the radiation is given by 

Khe 

Wm A C 


Kk 2n’me*® [to 1 

Qn (4m€)*hFc | nz ~ np yl 
This result agrees with the formula (2.20) if the Rydberg constant 

___2n?me* (2.32) 
(4n€9)"hc * ‘ 

The value of R calculated from (2.32) using the accepted values of m, e, 
c and h agrees excellently with the experimental value. This theory was 
a mixture of classical and quantum concepts. Bohr assumed that classical 
theory could be used to calculate the energies of the states but used his 
own quantum argument to determine the states allowed. 

Bohr’s theory dealt only with circular orbits. A. Sommerfeld (1916) 
(see also Wilson 1915), generalized the Bohr theory to include elliptic 
orbits. He said that the allowed states of a periodic system are determined 
by the quantum conditions 


§ pi dq; =n;h n; integer (2.33) 


where p; is the momentum conjugate to the co-ordinate q; and the 
integration is over a complete period of the motion. 

Consider an electron revolving about the hydrogen nucleus in a plane 
elliptic orbit, with the nucleus at one focus. The single Bohr condition 
(2.28) is replaced by the two conditions 


§ pg dg=ngh — py dr=n,h (2.34) 
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Figure 2.5 Elliptic electron orbit. 


where 1g, n, are both integers. The momentum Pg is simply the angular 
momentum and from Kepler’s law is a constant. 


Pg2n =Nngh or Po =Ngh 


as assumed by Bohr. ng is called the angular or azimuthal quantum 
number. n, is the radial quantum number and the principle quantum 
number is defined by 


n=n,+ng. (2.35) 


It can be shown that Bohr’s energy expression (2.30) holds for elliptic 
orbits where the 7 in the formula is now the principle quantum number. 
The introduction of elliptic orbits does not produce any new levels for 
the hydrogen atom although more than one state (orbit) may have the 
same energy. However, when the relativistic mass variation of the electron 
is included it is found that this ‘degeneracy’ is removed and that the 
energy depends upon both x and ng. Of course, a particle electron 
possesses three degrees of freedom and it is to be expected that a third 
quantum number should be necessary to define a state completely. This 

is the magnetic quantum number. 


Simple harmonic oscillator 


When the Sommerfeld rule is applied to the linear harmonic oscillator the 
energy levels used by Planck are obtained. Consider a particle of mass m 
oscillating about the origin in the x-direction. The energy F of the system 
is given by 

p? 

E =*— + 2n?v?mx? 

2m 
where p is the momentum conjugate to x and pv is the frequency. The 
Sommerfeld ‘action’ integral (2.33) becomes 


* lor 1/2 
$ p(x) dx =2m | a 4n?p?x? dx =nh n integer 


—a 
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where 


JE 
a= 2E han, 
m 


This integral can be evaluated by substituting 


RE 
x= /— sin 6/2av 
m 


to give 
E 
—=nh 
v 

ie. 
E=nhv 


This is equation (2.5). 


Bohr correspondence principle 


The Bohr theory of hydrogen atom even when developed using the 
Sommerfeld condition was clearly not complete. It did give an expla- 
nation of the Zeeman and Stark effects, but unfortunately the Bohr 
theory predicted too many spectrum lines. Some type of ‘selection 
rules’ had to be imposed. Also, although it was obvious that different 
spectral lines had different intensities the Bohr theory had no expla- 
nation of this. In 1918 Bohr added another rule to his theory which was 
to prove very successful in overcoming the difficulties mentioned above. 
This was the correspondence principle. 

In his original paper (1913) Bohr reasoned that in the larger atomic 
orbits, where the energy jumps are smoothed out, the quantum theory 
must give the same results as classical theory. Consider an electron jump 
from a hydrogen state with n; = N + 1 to a final state np = N. From (2.30) 
the frequency of the emitted radiation is 


he 2n?me* | 1 1 
(4ne9?h? |N2 (N +1)? | 
In the large quantum number limit N > © this frequency becomes 
4n2me* 1 (2.36) 
veo : 
(4ne9)°h? N? 


Now consider the frequency calculated from classical electromagnetic 
theory. Classically the emission frequency is equal to the frequency of 
revolution of the electron in the circular orbit. (Electrons in elliptic 
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orbits also emit frequencies which are integral multiples of the fundamen- 
tal frequency.) That is, the classical frequency is 


p=v/2ar 


where v is the electron velocity and 7 is the orbit radius. On substituting 
for v andr using (2.28) and (2.29) it is easily seen that the classical 
frequency is equal to the limiting quantum frequency (2.36). In the 
case where the energy change associated with a transition is small com- 
pared with the energies of the initial and final states, the quantum 
description approaches the classical description. 

In 1918 Bohr extended the original form of the correspondence 
princple and applied it to large energy jumps and managed to obtain 
rules for calculating intensities. Although it proved quite successful it 
was an unsatisfactory mixture of quantum and classical theory and will 
not be considered further. 


2.4 Failure of the Old Quantum Theory 


There were obvious weaknesses in the ‘old quantum theory’ of Bohr and 
Sommerfeld. It was seen from the beginning to be a mixture of quantum 
and classical ideas. The electron was regarded as a point charge and the 
Sommerfeld rules picked out the allowed orbits which had energies cal- 
culated using the classical theory. However, even though classical theory 
demanded that the accelerating electrons must emit radiation, Bohr 
postulated that this did not happen for electrons in a stationary orbit. 
Also, non-periodic phenomena were not included. There was no quantum 
theory of a-particle emission. 

Although the Bohr-Sommerfeld theory allowed only integral quan- 
tum numbers it was discovered that certain systems demanded half- 
integral quantum numbers. Such an example was the rotation spectra of 
the hydrogen halides molecules. The explanation of this requires a spin 
structure for the ‘particle electron’. In some cases quantitative disagree- 
ment occurred between the old quantum theory and experiment (e.g. the 
normal state of the helium atom) and in others even qualitative agreement 
was not even achieved. Disatisfaction with the old quantum theory even- 
tually lead to the discovery of the matrix mechanics of Heisenberg and 
the equivalent wave mechanics of Schrédinger (1926). 


2.5  Wave-Particle Dualism 


Dual nature of light 


By the end of the nineteenth century, the wave theory of light was 
firmly established on the basis of interference and diffraction experiments 
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and the success of the classical electromagnetic theory. This view was 
challenged by Einstein in his theory of the photoelectric effect in which 
energy is transferred to the electrons in quanta of energy hv called 
photons. Further evidence concerning the dual nature of light was given 
by the production of X-rays. 
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Roentgen discovered X-rays in 1895 and the wave nature of X-rays 
was established by von Laue in 1912 by his diffraction experiments. 
X-ray production may be regarded as the inverse of the photoelectric 
effect and is brought about by directing a stream of electrons onto the 
anti-cathode in an X-ray tube. If the X-rays produced are then used in a 
photo-electric experiment to produce electrons it is found that the maxi- 
mum energy of a photo-electron is equal to the energy of one of the 
electrons in the stream which originally produced the X-rays. This implies 
that the X-rays transport their energy across space in photon bundles 
without any dissipation. 

In 1922-23 Compton took the photon concept a step further when he 
associated a momentum with the photon in his explanation of the 
scattering of hard X-rays. 


Figure 2.6 


Compton effect 


The American physicist Arthur Compton discovered that when hard 
(short wave-length) X-rays are scattered by atoms of low atomic number 
(graphite) the scattered radiation includes not only the original wave- 
length but also softer X-rays of longer wave-length. This is contrary to 
the classical theory which demands that the scattering electrons vibrate 
with the frequency of the incident radiation and so emit radiation of 


36 A FIRST COURSE IN QUANTUM MECHANICS 


exactly this frequency. However Compton was able to explain this 
phenomenon by assuming that the X-rays consist of a collection of 
photon particles each having an energy hv and also a momentum hv/c 
where c is the velocity of light. 


Scattered photon 
Energy hy’ 


yoy 


Incident photon “\_/\_S\_S/™ ~~ 7---—-—----- 


Energy hy 


Recoil electron 


Figure 2.7 Compton scattering. 


Consider an incident photon of frequency v which collides with an 
electron and is then scattered. For materials of low atomic number the 
energy of the hard X-ray photon will be very large compared with the 
electron binding energy and so the electron may be considered to be 
free. During the collision, energy is conserved and so 


hv =hv'+T (2.37) 
where T is the kinetic energy of recoil of the electron and v’ is the 


frequency of the scattered photon. From special relativity theory 


1 
T=mo0?| == ~-1 (2.38) 


v 
1-(2) 
c 
where mp is the electron rest mass and v is the speed of recoil. Applying 
the low of conservation of momentum 


hu m 
—=— cos¢+——— _ 0 c080 


and 


uo = v sin 0. (2.39) 
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There are three equations involving the three unknowns p’,v and 6 and 
on solving it is found that the increase in wave-length is 
2h .2¢ 


6X =A’ —A= — sin 


2.40) . 
MgC 2 ( ) 


where and X’ are the wave-lengths of the incident and scattered X-rays 
respectively. Note that 6A is constant for a given scattering angle ¢ and 

so the percentage increase in wave-length is greater for small wave-lengths. 
This formula is in quite good agreement with experiment. 

When the frequency of the incident radiation is reduced, the binding 
energy of the electron cannot be ignored and instead of Compton scat- 
tering the photoelectric effect takes place or the X-ray undergoes Laue 
scattering. 

The work of Einstein and Compton when taken in conjugation with 
the work on interference and diffraction leads to the paradoxical result 
that electromagnetic radiation has a dual nature. Sometimes it behaves 
with wave-like properties and sometimes with particle-like properties. 
The attempts made to understand this paradox have resulted in the new 
quantum theory. 


Dual nature of matter 


In 1897 the English physicist J. J. Thomson succeeded in deflecting 
cathode rays by an electrostatic field and proved conclusively that the 
rays consist of material particles with a negative charge. For the next 
quarter century no doubt was expressed as to the particle nature of the 
electron. 

During 1922-23 the Frenchman Louis de Broglie suggested that not 
only does radiation have a dual nature but material particles also require 
a wave-particle description. He was primarily interested in the nature of 
light and on the basis of Einstein’s relativity theory he was convinced of 
the dual wave-particle nature of light. 

From special relatively, the relation between the total energy F of a 
particle in field free space and its momentum p and rest mass mg is 


2 
(=) oer) 2.41) 
Taking the rest mass of a photon as zero 


2 
(=) = p* or E = pe. 
c 


This final result is consistent with the Einstein formula 
E=hv 
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and the Compton result 


poh. 
In relativity both radiation and matter appear as forms of energy. L. de 
Boglie suggested that any moving particle has associated with it a group 
of waves whose wave-length A depends on the momentum of the particle 
according to the equation 


peh/n. (2.42) 


This is the equation used by Compton for photons. The wave-length 
associated with large bodies (e.g. a billiard ball) moving at low speeds is 
far too small to be detected experimentally. However, the wave-length 
becomes appreciable for atomic particles. An electron with a 10 eV 
kinetic energy has a wave-length of 5:3 x 10~° cm (5 A). 

De Broglie noticed correspondences between the classical theory of 
light and the classical theory of mechanics which help to suggest equation 
(2.42). The French mathematician Fermat had reduced the laws of geo- 
metrical optics to the principles of ‘least-time’. That is, a light ray follows 
the path requiring the least time. The time taken for a light ray to travel 
from a point A to a point B is given by the line integral 


r 1 
——— ds 

| Vy, ¥, Z) 

A 


where V,, is the phase velocity in the medium and is a function of 
position in general. As 


where p is the constant frequency, Fermat’s principle may be written 


1 
———— ds =0. 2.43 
A, y, Zz) . ¢ ) 
A 


There is a corresponding principle in classical mechanics. This is 
Hamilton’s principle (1.52) which states 


B 
8 { Lar=o. 
A 


For a single particle of constant total energy E and kinetic energy T then 


L=2T-E as E=Ttv 
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and so 
B 
8 { 27 ar=0. 
A 


By changing the variable of integration from the time to distance it is not 
difficult to show that this final result can be written 


B 
5 } p(x, y, z) ds =0 (2.44) 
A 
and the similarity between (2.43) and (2.44) is striking. The relation 
p = constant/A 
is immediately suggested. 


This analogy between the variational principles of geometrical optics 
and particle mechanics had of course been noticed much earlier and had 
resulted in the Hamilton-Jacobi equation discussed in Chapter 1. 
Schrédinger used this equation in his development of the Schrédinger 
wave equation for quantum mechanics but the discussion of the relation 
between the Hamilton-Jacobi and Schrédinger equations is left till 
Chapter 4. 

The first experimental verification of the de Broglie matter waves 
was obtained in 1925 by Davison and Germer who were conducting 
experiments involving the scattering of slow electrons by nickel in a 
vacuum. De Broglie’s formula was completely verified by G. P. Thomson 
using a thin film of gold for his scattering material. 

Another aspect of the relation between particles and waves is 
illustrated by considering the concepts of phase and group velocity. 

The phase velocity Vp of a de Broglie wave is given by 


Vp =vd (2.45) 


where v and A are the frequency and wave-length associated with the 
particle. Writing, with the usual notation 


E=hp and p=hfr 


then 
Vp =E/p. 
In free space, relativity gives 
| es c2 a (2.46) 


v \2 
-@) 

c 
where 7g is the particle rest mass and v is the particle velocity. 


Vp =? /v. (2.47) 
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As energy cannot be transmitted faster than light, v<c and so 
Vp >e. 


The de Broglie waves for a free particle travel with a phase velocity 
faster than light. However there is no conflict with relativity as the 
energy of the waves (or particle) is transmitted with the group velocity 
not the phase velocity. This is explained below. 


Group and phase velocities 
The equation of motion of a plane wave of frequency v and wavelength d 
moving in the x-direction is 


W(x, 1) =A sin anu = =) 


where A is the amplitude. This wave moves with a phase velocity 
Vp =vnr. 


It is possible to form a concentrated wave packet by taking a sum of a 
large number of plane waves with slightly different wave-lengths and 
frequencies. If the medium is dispersive so that plane waves with dif- 
ferent frequencies move with different phase velocities, the wave packet 
constructed moves with a velocity different from the wave velocities. 
This new velocity is called the group velocity. 

The essentials of the argument are obtained by considering the inter- 
ference of two waves of similar but different frequencies and wave-lengths. 
Take 


W(x, t) =A [sin (wt — kx) + sin (w't — k'x)] 
where k, k’ denote the wave numbers such that 
K=2n/k and = k'=2n/r' 
and w, w’ denote the angular frequencies given by 
w=2m and = w'=2nv’. 


It follows that 


. we — k'-k 
W(x, t) =2A cos 5 t- 5 x 
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Figure 2.8 Wave packet. 


Put w’ = w + dw and k’ =k + 8k and then approximately 


wx, t) = [24 cos| t—- *,| | sin (wt — kx). (2.48) 


The combined disturbance represents a carrier sine wave with frequency 
vy and wave number k with an amplitude that slowly fluctuates so that 


Figure 2.9 Groups of waves or beats. 


there are groups of waves with gaps in between. The groups of waves 
move with a group velocity 


) 
Vg = SE where ww) = 20. 
In the limit as 5k > 0 the group velocity becomes 
dw 
= 49 


For the de Broglie waves corresponding to a particle, the total energy 
E and linear momentum p are given by 


E=hv p=hk 
and so the group velocity is 
ae 


ave (2.50) 
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From equation (2.41) 


28 OE 
c? ap P 
and so the group velocity is (from 2.46) 
2 ye? 
V, = Bo = aT =p. 
E c 


The group velocity of the de Broglie waves is equal to the corresponding 
particle velocity and so from (2.47) 


VgVp = 07. (2.51) 


De Broglie waves and the Bohr atom 


Consider the Bohr-Sommerfeld condition for a circular atomic orbit. The 
angular momentum pg satisfies 


$ dd =ngh Ng integer 


2arp = nigh 

where p is the linear momentum. From the de Broglie condition 
p=h/r 

and so the circumference of the circular orbit is 
2ar =ngd. 


The Bohr-Sommerfeld quantization rule is equivalent to demanding that 
the length of the circumference of a stationary Bohr orbit is equal to an 
integral number of de Broglie wave-lengths. This is additional evidence 
that particles have wavelike properties. 


2.6 The New Quantum Mechanics 


Werner Heisenberg 


In 1925 the young German physicist Werner Heisenberg introduced the 
new matrix mechanics. His work was based upon the inadequacies of the 
Bohr correspondence principle rather than the hypothesis of de Broglie 
waves. He found it necessary to introduce non-commutative algebra into 
physics. If g and p represent the position and conjugate momentum for 
a particle then Heisenberg found that in his atomic theory gp is not 
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equal to pq. In 1925 he applied his theory successfully to the simple 
harmonic oscillator and explained the phenomena of zero-point energy. 

Heisenberg found it necessary to introduce a new calculus. Max Born, 
a professor in the university of Gottingen where Heisenberg was working, 
pointed out that this ‘new’ calculus was precisely the calculus of matrices 
previously discovered by the English mathematician A. Cayley. Born and 
a colleague Jordan sought to extend Heisenberg’s work to obtain a new 
theory of the atom. They obtained the important result 


pq — ap =~ ih, (2.52) 


where i is the square root of minus one. The significance of this will 
become apparent in a later chapter (Chapter 5). 

In January 1926 the Austrian Wolfgang Pauli successfully applied the 
Heisenberg theory to the hydrogen atom problem and in the same month 
the young Englishman Paul Dirac produced a more elegant, though more 
abstract, application of the Heisenberg theory to the hydrogen atom. 

He also obtained the relation between the Poisson brackets of classical 
mechanics and the commutators of quantum mechanics and showed that 
there is a very close relationship between classical and quantum 
mechanics. 


Erwin Schrodinger 


At about the same time the young Austrian Erwin Schrodinger was 
developing his wave mechanics which seemed at the time to be com- 
pletely distanct from Heisenberg’s work. He solved the hydrogen 
problem and published his results in the same year as Pauli and Dirac, 
1926. In February he gave the solution for a harmonic oscillator. Bohr 
had found it necessary to postulate the existence of quantum numbers, 
Schrédinger was able to derive them. 

Schrédinger knew that in many classical wave problems, discrete 
frequencies arise naturally when boundary conditions are imposed. 
Consider the vibrations of a string of length / with both ends fixed. 
This transverse displacement (x, f) of an elastic string lying along the 
x-axis obeys the wave equation 

ao 8361 876 

ax? 2 a2 0 (2.53) 
where V, is the phase velocity of the wave and f is the time. Variable 
separable solutions that satisfy the boundary conditions 


900, =¢% )=0 
exist and have the form 


naVyt 
(x, =A sin an = +e}, (2.54) 
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Figure 2.10 Vibrating string. 


where A, € are constants and n can take the integer values 1, 2,.... The 
solutions (2.54) are called ‘eigenfunctions’ with corresponding discrete 
‘eigenfrequencies’. 


Vy =nV>/2L 


Schrédinger was able to set up the wave equations governing the 
motion of a particle. His equation is obtained below although the argu- 
ment given is in no sense a proof. Wave mechanics cannot be derived 
from classical mechanics but must be postulated as in Chapter 3 of 
this text. The truth of the postulates depends upon the accuracy of the 
predicted results. 

From classical mechanics, the total energy of a particle of mass m in 
a conservative field is given by 


p? 
= + x 
E ren (2.55) 


where p is the particle momentum and V is the potential energy. The 
particle momentum can be written 


p=tV/2m(E — V). 
The phase velocity of the de Broglie matter waves is 


yee 
P W2m(E-V) 
By combining the wave equation (2.53) and this phase velocity 
expression the wave equation for matter can be written 
2mE-V) d?v _ 
E? ar? 


Vp (2.56) 


Vey (2.57) 
where V(x, y, z, ¢) denotes the amplitude of the matter wave at a point 
(x, y, z) at time ¢. 

States of constant energy correspond to states of constant frequency 
and for these the solutions can be written 


W(x, y, z, = WO, y, 2) exp [—2zivt]. (2.58) 
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Equation (2.57) becomes 


E 
2m|{. : V\ r 


Vert nv? W =0. 


Using the Einstein relation £ = hv then finally 


nh h? 
—5 7 Wt VY HE’ or —5~ Vw += Ev. 
” (2.59) 


This final equation is known as Schrédinger’s time-independent wave 
equation and gives the states of constant energy. 


Linear box 


Consider the idealized problem of a particle restrained to lie in a linear 
box of length / by walls of infinite potential. Suppose 


V(x) =0 0<x<il 
= 00 otherwise. 


The Schrédinger equation is 
————, =Ep O0<x<l 


The boundary conditions to impose are 
¥O)=¥@)=0 


as the particle must lie within the box. The eigenfunctions (allowed 
solutions) are 


Vale) =A sin n=1,2,... 


x=0 x=l 


Figure 2.11 Linear box. 
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where A is a constant and the eigenvalues are (allowed energy values) 
n>? 


n= pe n=1,2,.... (2.60) 


The application of boundary conditions to the Schrédinger equation 
naturally results in discrete energy values and hence in quantum 
numbers. 


The time-dependent equation 


The time-independent equation (2.59) is sufficient for discussing states 

of constant energy but a more general equation is required for describing 
other states. It is clearly necessary to eliminate the energy FE, For solutions 
of the form (2.58) 


aw 
EW = +ih — (2.61) 


and combining this result with the time independent equation (2.59) the 
time-dependent equation 


fos aw 
—_ + —: —, 
Teo per Fr (2.62) 


is obtained. 

The derivation of the Schrédinger equations (2.59) and (2.62) given 
above is not a proof but is given simply to indicate how they were first 
revealed. A formal derivation based upon the postulates of quantum 
mechanics is given in the next chapter. These equations are non-relativistic 
and correctly describe the motion of a particle only at speeds small com- 
pared with the velocity of light. Paul Dirac (1928) established the relavistic 
equation. 


The wave function 


The complex wave function V(x, y, z, t) is a solution of Schrédinger’s 
equation and in quantum mechanics takes the place of the classical 
particle trajectory. W is some measure of the probability of finding the 
particle although this probability cannot simply be W itself since Y can 
be negative. Max Born gave the correct interpretation in 1926. He said 
that 


W*(x, y, z, OU, y, 2, bxdy5z = |W |? 5xby hz (2.63) 


is the probability of finding the particle in the volume element 6x dy 6z 
about the point (x, y, z) at the time ¢, when a large number of position 
measurements are made on independent single-particle systems each of 
which is described by the wave function (x, y, z, 2). 
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As the total probability of finding the particle somewhere in space 
must be unity the wave function must be normalized to unity, i.e. 


fure, yp, 2, t) V(x, y, z, Hdt=l. (2.64) 


The normalized solutions to the linear box problem are obtained from 
(2.59) to be 


bn = [7 sin n=1,2,.... 


The corresponding probability density 


nuix 


p(x) = VE &) na) = - sin? ; 


and in the large quantum number limit n > © the average value of p(x) 
over any interval 6x is 


1 
pats) = T (2.65) 
From Bohr’s correspondence principle this should agree with the 
probability density of a classical particle in the linear box. To establish 
this, consider a classical particle moving along the x-axis with constant 
speed v between reflecting walls at x = 0 and x =1 The probability of 


finding the particle in the element 6 x is proportional to the time the 
particle spends in this element and so 


bx 
P(x)dx =k 7 
where k is a constant of proportionality. As 


I 
{ P@)dx=1 
0 


then 

k=l 
and so 

P(x) = 1/1 


in agreement with (2.65). 


The equivalence of wave and matrix mechanics 


The matrix mechanics of Heisenberg and Dirac was developed from the 
classical Poisson brackets of Hamiltonian mechanics whereas the wave 
mechanics of Schrédinger was developed from classical wave dynamics. 
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The first steps in showing the equivalence of the new matrix and wave 
mechanics were taken by Schrédinger himself. In 1926 he discovered 
that there is a very simple connection between Hamiltonian dynamics 
and wave mechanics. 

Consider the classical Hamiltonian H for a ‘one-dimensional’ particle 
of mass m moving in a conservative field. If F is the total energy then 


ps 
H=52+V(x)=E 
ae (x) 


where P,, is the momentum and V(x) is the potential energy. Schrédinger 
suggested replacing the independent variable by itself and also replacing 
the momentum P,, by the differential operator —im d/dx. A quantum 
mechanical Hamiltonian # is then defined by 


h? 2 


d 
= —— + 
x 2m dx? EQ) 


Note that # is an operator. The Schrddinger time-independent equation 
(2.59) can then be written 


Ay =Ey. 
Schrédinger also realized that the strange result 
DxX — XPy =—ih (2.52) 


inherent in Heisenberg’s theory then followed as 
d Pee eee 
aa aa aa qe +XPx. 


Later in the same year Dirac showed the complete equivalence of the 
matrix mechanics and the wave mechanics. They are both different 
aspects of the new quantum mechanics (see Chapter 6). 


The uncertainty principle 


In 1927 Heisenberg expounded his famous “Uncertainty Principle’. In 
general terms this states that it is impossible to know precisely the exact 
position of a particle and its momentum simultaneously. When the 
position is measured the momentum is disturbed and, unlike classical 
mechanics, the disturbance cannot be allowed for. It can be shown that 
if Ax and Ap, represent the uncertainties in the x-coordinate and x- 
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component of linear momentum then 
AxAp, ~h 


This will be formally derived in Chapter 5 and is a consequence of 
Heisenberg’s relation (2.52). A simple wave-packet ‘explanation’ will 
suffice here. Consider the de Broglie-wave packet (2.48) consisting of 
the superposition of two plane waves. The distance between the maxima 
of the modulated wave is 


Ax =2n/Ak 
or 


Ax Ak = 20. 


In general, if a group of waves of spread Ax and mean wave number k 
is analysed into its constituent plane waves, then the wave numbers of 
the constituents will be found to be clustered about the mean value k in 
a range AK such that 


AxAk~ 1. 


After de Broglie 
Ap, =hAk 
and so 


AxAp, ~h. 


Ax is the uncertainty of position of the particle described by the group 
of waves and Ap,, is the corresponding uncertainty in momentum. To 
specify the position exactly so that Ax = 0 an infinite number of plane 
waves would be required and so Ak > ©. On the other hand, to specify 
the momentum completely only one wave must be used and the wave 
packet will be of infinite extent so that Ax > °. 


Further developments 


The new theory developed very rapidly after 1926. In 1927 Dirac gave a 
quantum mechanical description of the electromagnetic field. Spin was 
introduced into quantum mechanics by Pauli and the English physicist 
C. G. Darwin and in 1928 Dirac gave a relativistic theory of quantum 
mechanics from which spin arose naturally. The reader is referred to 
Hoffmann’s The Strange Story of the Quantum for an interesting non- 
mathematical account of the origins of Quantum Theory. 
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PROBLEMS 


1 Suppose radiation of wave-length \ forms standing waves inside a 


cubical box of side length a with perfectly reflecting walls so that the 
walls can be taken as nodes. Explain why 


ny 3 =a 
r 

na > =am 
r 

ng an 


where 71), 12, N3 are positive integers and where J, m, n are the direction 
cosines of the radiation direction relative to a set of orthogonal axes 
coincident with three adjacent sides of the cube. Using the relation 


+m? +n? = 


show that the frequencies of the allowed modes are given by 
l 
=(y2 4n2 472)? 
ve(njitnyztn 
(nj +n} +n3)!? 


where c is the velocity of the radiation. 

Construct a set of orthogonal axes corresponding to the variables 
n,c/2a,nzc/2a,n3c/2a. Each of the points in the cubic lattice in the 
positive octant corresponds to an allowed frequency mode. Deduce that 
the number of allowed frequencies in the range between v and p + dp is 


1 4nv? 


and that the total number of allowed modes per unit volume after 
consideration of polarization directions, is 
2 
v 
8a —5 bv 
eo 


bp 


(This result can be used to obtain the equation (2.1).) 


2 Consider the Planck law (2.10) for the radiation emitted by a black- 
body radiator. Show that in the high frequency limit v > © that 


Srhv® 
EQ)= a eee 


This is Wien’s experimental law. 
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3 In free space the electric field flux out of a closed surface s is given by 
$e .nds =q/€o 
s 


where @is the electric field, n is the unit outward normal from the 
surface, q is the included charge and €g is the permittivity of free space. 
The electrostatic potential ¢ is defined to within a constant by the 
equation & = —grad ¢. Show that ¢ cannot have a maximum or minimum 
value except at points where there is a positive or negative charge respect- 
ively. Hence deduce that a free charge cannot be in stable equilibrium at 
a point unoccupied by charge. This result implies that no stationary 
arrangement of charges can be in stable equilibrium under their own 
influence alone. 


4 Use the Bohr theory of the hydrogen atom to show that the radius 
of the first Bohr orbit is 
e 


40 2Rhe(4neo) 
where R is the Rydberg constant and that the energy of this state is 
E = Rhc = 13-6 eV. 
5 Consider a free simple rotator composed of two point masses M dis- 


tance 2a apart. Show that if the rotator is free to rotate in a plane about 
the centre of mass, the total energy is 


E = Ma?w? 


where w is the angular velocity. Use the Bohr-Sommerfeld quantization 
rule to show that w can only take the values 


where J is a positive integer or zero. 

For a diatomic molecule a reasonable estimate for a is the radius of the 
first Bohr orbit (see problem 4). Show that the allowed energy levels of 
the rotating system are 


= ™\ 72 
E=Rhec (i)? 


where R is the Rydberg constant and m is the electron mass. 
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6 Consider a plane ‘matter’ wave of frequency v and wave number k. 


k 
W(x, t)=A exp |-i2" (» - +)| . 
2a 
Rewrite this equation substituting for v and k using the relations 
E=hv and Dx =hk 


where E£ is the energy of the particle and p, is the momentum. Deduce 
that 


ene 
ae ax Px 


and 

ow 

ih— =EW., 

ar 
Hence show that the classical energy expression for a free particle 
1 
E=~—p?* 

2m 


can be expressed as the wave equation, 


wy ov 
2m ax? at 
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CHAPTER 3 
Quantum Mechanics I 


3.1 Introduction 


In the previous chapter the early attempts to account for the breakdown 
of classical mechanics were briefly outlined. The new theory was clearly 
not satisfactory and it was left to Schrédinger, Heisenberg and others to 
develop the quantum mechanics. Historically quantum mechanics was 
evolved using intuition and analogy with classical mechanics. Even so, 
there is a basic difference in philosophy between them, and quantum 
mechanics can certainly not be derived from classical mechanics. 

For a system obeying classical mechanics, (this is an approximation, 
although it is very often a very accurate one), if all the positions and 
velocities of the particles are known at some instant then the state of the 
system at some later time is completely determined by Newton’s laws. 
This is not the case for a system obeying quantum mechanics. In fact the 
position and momentum of a quantum particle can not both be known 
exactly at the same instant and even the concept of a particle trajectory 
loses its meaning. There is a limit to what can be known about the state 
of a quantum system. In quantum mechanics the physical quantities such 
as energies, momenta etc., which can in principle be directly measured 
are called observables but not all the observables of a system can be 
measured simultaneously: If some observable is measured, this act of 
measurement may disturb the system and change the value of some other 
observable. This is true for both classical and quantum systems, but there 
is a basic difference in outlook. The disturbance in a classical system 
produced by the act of measurement can in principle be allowed for 
exactly but not in quantum systems. The most complete description 
possible of the state of a system in quantum mechanics is less detailed 
than that in classical mechanics. Because of this, the behaviour of the 
quantum system at some future time is not uniquely determined. 
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Classical mechanics is based on Newton’s three laws. These cannot be 
proved directly but are verified by the experimental results obtained 
from ‘classical systems’. Similarly, quantum mechanics is based on 
postulates. At first the postulates seem strange, but their validity is 
confirmed by the agreement of the theory with experiment. This postu- 
late statement of quantum mechanics is the most effective introduction 
to the subject and this is given below. 


3.2 The Postulates 


Postulate I 


Any state of a system with n degrees of freedom is described as completely 
as possible by a wave-function Y(q,, . . ., In, £) Which depends upon the 
co-ordinates g; and the time. W is in general complex and as it has to be 
physically meaningful it is single-valued. The wave-function may be 
multiplied by an arbitrary complex number without any essential change 
in its physical significance. It will be shown that Y and its derivative are 
continuous except at a certain number of points and in general it is also 
quadratically integrable, i.e. 


fury dr=k? k finite and real (3.1a) 


where Y* is the complex conjugate of and the field of integration is 
over all values of the co-ordinates. As the wave-function can be multi- 
plied by a constant without changing its character, it is often convenient 
to describe the state by 


1 
VW ,=—¥. 
1k 
This new wave-function is ‘normalized’ in the sense 
J tv, dr=1 (3.1b) 
and once normalized remains so for all time. ‘Y¥, remains normalized if it 
is multiplied by any complex number of modulus unity. 


Definition 1. Linear operator 


An operator transforms one function into another. A well-known 
example is the differential operator D = d/dx, e.g. 


Dx? = 3x?. 
An operator @ is said to be linear if 
a, + ¥2)=a¥, tab’, and a@yv)=aaV (3.2) 
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where W,, W, are arbitrary functions and a is an arbitrary constant. The 
operator D is linear. 

A simple example of a non-linear operator is the square operator which 
will be denoted by S, i.e. 


SY =? 
then 
SQW, + 2) = (H+ ¥2)* 
FSW, +S. 


Definition 2. Hermitian operator 


Suppose Y, and V2 are arbitrary quadratically integrable functions. The 
linear operator @ is said to be Hermitian if 


[¥t@w,) dr= fen)" ar (3.3) 


the integration being taken over all values of the co-ordinates. Clearly a 
linear combination of Hermitian operators «a, 8 is itself a Hermitian 
operator and any power of a Hermitian operator & is Hermitian, e.g. 


[vt(a+6y¥, dr=[vfaw, dr+ {view dr 


=f Vj)" dr + | ¥o(6W,)* dr= [Ho [@+ Bi] * ar. 
(3.4) 


Postulate 2 


With every physical observable there is associated a linear operator. This 
linear operator is Hermitian. 

Of course it is essential to determine the precise form of the operator 
a which represents the observable A. It is at this point that analogy with 
classical mechanics is important. Suppose a and 8 are two quantum 
operators representing variables A and B. The commutator of these two 
operators is defined to be 


[Note; the r.h.s. of (3.5) is not in general zero. Suppose a = x and 6 = d/dx 
then 


cB ge Ne) ee men 
* ax] |" ax dx | |? dx dx 
Commutators have many of the properties of Poisson brackets, e.g. 


[e, 6] =—[8,a], [a,a] =0. 


The quantum operators a, 8 are such that their commutator is propor- 
tional to the classical Poisson bracket of A and B. The constant of 
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proportionality is a universal constant (with the dimensions of ‘action’, 
i.e. Joule-sec.) and introduces the Planck constant h into quantum 
mechanics. 


[a, 8] =ih{A, B} nee i=V-1. (3.6) 


(If there are any variables remaining on expansion of the Poisson bracket 
they are to be replaced by operators.) 

The form of the operator is not uniquely determined by (3.6) but 
depends on the ‘representation’ used. In the commonly used Schrédinger 
representation any position co-ordinate q; or the time t is represented by 
qj OF t respectively. The momentum p; conjugate to q; is represented by 
the operator 


Hee 
i dq; 
It is easy to verify that the commutator of the operators representing 
qj and pj; is 
ha 
ivan A det (UTR 3.7 

[es é 7 F G7) 
and by comparison with (1.30) it is seen that (3.6) is indeed satisfied. It 
is obvious that the operator q; associated with the co-ordinate q; is 


Hermitian. Similarly it is not difficult to show that the operator 
representing the momentum p; is Hermitian. On integrating by parts 


filial Wea ak a) 2 ys 
jui(’ A WV, dq = VI V2 + | Ve ar Sa 


and if the wave function vanishes at infinity the first term on the r-h.s. is 
zero and the result is proved. Any classical observable (such as energy), 
which is a function of the co-ordinates, momenta and time is represented by 
an operator obtained by substituting 

hd 


> a amperes t>t. 3.8 
9:7 4% Pi i ai (3.8) 


For example, any function of p;, f(p;) is represented by 


and similarly for f(q;). Any functions of conjugate variables must be 
carefully ordered before converting into an operator. This is necessary 
because the operators representing the variables (q;, p;) do not commute. 
In classical mechanics it is meaningless to distinguish between p,q; and 
4jP; but it is essential to do so in quantum mechanics. The exact order of 
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the factors in a term can often be decided by remembering the condition 
that the operator be Hermitian. Ambiguity can sometimes be removed by 
taking the mean of two possible orders of the factors. The classical pro- 
duct q;p; (or p;q;) can be represented by the Hermitian operator 


It may be that the exact order can only be decided by trial and com- 
parison with experiment. The formulation in terms of rectangular position 
and momentum co-ordinates is least likely to lead to ambiguity. If necess- 
ary the operator can then be transformed to some other generalized co- 
ordinate system. 

For example, consider the Hamiltonian for a particle of mass m moving 
in a conservative field of force. The classical Hamiltonian is 


1 
ala (pz + py + p2)+ V(x, yz) 
m 


and the corresponding quantum mechanical operator is 


HS a2 


ee Fn \ ox? ay 


92 
By? 1 ag2 | 7 Vy 2)- 

There are other representations apart from the Schrédinger represen 
tation and in these the operators will take a different form. Suppose 
instead of (3.8) the representative operators are taken as 


ho 
ears Pi > Pi aS (3.9) 


The relation (3.6) is still satisfied in this ‘momentum representation’, and 
the quantum mechanical Hamiltonian corresponding to the example 
mentioned above is now 


a a 
H = sp (08 +3 +08) + Vn tha in2), 


The form of the potential function may make this representation difficult 
to use and unless otherwise stated the Schrédinger representation will be 
used throughout this book. 


Postulate 3 


In classical mechanics, Hamilton’s equations determine the time variation 
of a state. Similarly, the Hamiltonian operator for the system determines 
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the time variation of a quantum mechanical state in that the state wave 
function V must satisfy Schrédinger’s time-dependent equation 
HV =ih Lae (3.10) 
ot 
If the state function is known at some initial time, the equation (3.10) 
determines VY at any other time. 

Equation (3.10) is a second order partial differential equation defined 
for all points where the potential energy is finite. As the second derivatives 
exist at all points (where V is finite) then the first derivatives must be 
continuous and so VY must be differentiable and continuous. These 
restrictions on the state functions were mentioned in Postulate one. 


Definition 3. Eigenvalues and eigenfunctions 
If ® is a function and a is an operator such that 
ab =aP (3.11) 


where a is a constant then ® is an eigenfunction of a and a is the corre- 
sponding eigenvalue. The totality of all eigenvalues for an operator is 
called the spectrum of the operator and this may be discrete, continuous 
or partly discrete and partly continuous. The number of eigenvalues of a 
quantum operator is in general infinite. 

For example, the second-order equation 


has solutions X = A sin ./ax + B cos./ax where A, B are arbitrary. So, 
when no boundary conditions are imposed the spectrum of the operator 
d?/dx? is continuous. On the other hand if the boundary conditions, 


X(0)=0 X(L)=0, 


are imposed, the spectrum is discrete and the eigenvalues are 


nn \? 
an =\- n=1,2,... 


with corresponding eigenfunctions 


nu 
— x. 


X,(x) =A sin L 


Definition 4. Orthogonality 
Functions forming a set are mutually orthogonal if, for all pairs, 


JPFm dr=0 n#¥m, (3.12a) 
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the integration being over all the values of the co-ordinates. If in addition 
the functions are normalized to unity (3.1b) then 


[O%On dr = bam: (3.12b) 


The functions are then said to form an orthonormal set. 
A well-known example of orthogonal functions is the set 


1 
®, (x) = — sin ntx 
nl ) Van 
the members of which are orthogonal over the range 0 <x < 27, i.e. 
an 


| G:®n dx = Sam: 
0 


Theorem 1, Reality of eigenvalues 
The eigenvalues of a Hermitian operator are real. 


Proof 


Let the Hermitian operator be a and ®,, an eigenfunction belonging to 
the eigenvalue a, i.e. 


a®, =a,2, (i) 

| @* ab, dt = ap, i} @*®,, dr. 
Take the complex conjugate of (i) 

(ab,)* =anPp 

f@@,)*4, dr=ar [ O28, dr. 
As the operator is Hermitian 

[otad, dr= [(b,)*@, dr 

Qn =aq 
and the eigenvalue a,, is real. 


Theorem 2. Orthogonality of eigenfunctions 

Any two eigenfunctions of a Hermitian operator, belonging to different 
eigenvalues are orthogonal. 

Proof 


Let the Hermitian operator be a and the eigenfunctions be ®,,, Py», with 
corresponding eigenvalues a, , 2), i.e. 


ad, =an,P, (i) aby, =Am Pm (ii) 
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From (i) 
[ona®, dr=a, [o%e, dr. 


Since a is Hermitian 
fonat, dr = fe, (a®,_)* dr 


=[&,a%,6m dr (from (ii) 
=Am f O*,®,, dr 
aS dy, is real. 
@n — am) { 6*.6,d7 =0 
and as a, #@,, then 


Onn dr=0 n#m. 


Postulate 4 


Suppose the wave function ¥(q1,.-.» Qn, t) describing a state is an eigen- 
function of the operator a representing the observable A. i.e. 


av =a. 
Then in this state an exact measurement of the observable will yield 
precisely the constant real value a. (As a is Hermitian a is real.) 
The time-independent Schrodinger equation 


When the potential energy of the system is not a function of time, the 
time-independent Schrodinger equation (3.10) has variable separable 
solutions of the form 


V1, oe Ony t) = yv@, sees Qn)T(t). (3.13) 


(Note, the Greek yp is used to designate the spatial part of the state 
function Y.) Substitute (3.13) into (3.10) and divide through by V 


ad) -tOrE (3.14) 
E is a separation constant independent of q; and tf, ie. 

Y=ye aA (3.15) 
with 

HY; = EV; (3.16) 


is a solution of (3.10). 
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Equation (3.16) is the time-independent Schrédinger equation. 
(also W;) is an eigenfunction of #; Ej; is the corresponding eigenvalue 
and is the energy of the state described by W;. Such states in which the 
energy has a definite value are called stationary states of the system. 

Consider a free particle of mass m moving along the x-axis with no 
force acting. The classical Hamiltonian is p2/2m and the Schrédinger 
equation governing the motion is 


ay ov 


ca ear er i or (3.17) 


The spatial parts y of the variable separable solutions satisfy 


and so 
w(x) =A4 elk +B oe ikx 
with 
2,2 
E= nk, 
2m 


The boundary conditions would determine the arbitrary constants and 
also the allowed energy values. The complete variable separable solution is 


U(x, t)= Ad -F 1) 4 pete +5 9. (3.18) 


Observe that the two separate solutions are also eigenfunctions of the 
momentum operator (#/i)(d/dx) with eigenvalues tk and represent 
uniform motion in the positive and negative directions respectively. 


Definition 5. Linear independence 
A set of functions {¢; } is said to be linearly independent if the relation 
101 +426, +... +46, =0 (3.19) 


implies that the constants a; = 0 for all i. This means that none of these 
functions can be expressed in terms of the others. 


Theorem 3 


Any set of mutually orthogonal functions is linearly independent. 


Proof 
Let the orthogonal set of functions be { ¢;} and suppose that (3.19) is 
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satisfied. Multiply equation (3.19) by o and integrate over all values of 
the co-ordinates. 


-| oF + : =0. 
2, 01 | ots ar ay |; % ar 0 
As the functions are orthogonal then the first term vanishes and so 
qj { $9 dr =0. 


Clearly this integral is non-zero and so aj = 0. This is true for all j and so 
the orthogonal functions are linearly independent. 

Conversely, if a set of non-orthogonal functions are linearly indepen- 
dent and are also quadratically integrable, it is always possible to form a 
mutually orthogonal set from them by a suitable linear transformation. 
This is the Schmidt method. Suppose {0;} is the non-orthogonal set, 
then an orthogonal set {¢;} can be defined as follows. 

Let $; =6,, and then choose a with $2 =@2 + a, so that $2 is 
orthogonal to ¢), i.e. 


a=— j 0192 ar] { $79) ar. 


Next choose B;, 82 with $3 = $3 +B1$1 + B2$2 So that $3 is orthogonal — 
to oy and o2 etc. 


Definition 6. Function space 


The function space defined by the linearly independent set of functions 
{0 ;} includes all functions of the form 


B10, +b202 +... t+bnPn (3.20) 
where b;’s are any complex constants. The number of linearly indepen- 
dent functions used to define this space, gives the dimensionality. Clearly 
the orthogonal set of functions {¢;} obtained from {6;} by the Schmidt 
process define the same space. Either of the basis sets {¢;} or {0;} are 
said to span the space in that any function in the space can be expressed 
as a linear combination of the members of either set. This definition is in 
accordance with the more usual ideas of Cartesian space. The unit vectors 
1,, 12,1, are orthogonal vectors and are also linearly independent. 
Together they define a three-dimensional space which includes all vectors 
of the form 


r=cyl, teh +¢313. 


The space defined by the functions {x;} is said to be a proper sub- 
space of the function space {6;} if every function in the sub-space 
belongs to the space {6;} and if the converse is not true. 

The plane defined by the unit vectors l,,1, is a proper sub-space of 
the three-dimensional space given by lj, ly, 13. All the vectors in the 
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sub-space are of the type 


r=cl,+c.h, 


and obviously belong to the full Cartesian space. Alternatively, any 
vector including 1, while belonging to the three-dimensional space does 
not belong to the sub-space. 


Definition 7. Degeneracy 


There may be several eigenfunctions of an operator belonging to the 
same eigenvalue. If there are n linearly independent eigenfunctions 
corresponding to a given eigenvalue, the eigenvalue is said to be n-fold 
degenerate. These eigenfunctions can always be chosen to be orthogonal 
by the Schmidt process. 


Postulate 5, Expectation value of an operator 


Suppose there are a large number of identical systems each described by 
the same normalized state function, W. If V is an eigenfunction of the 
operator a representing the observable A then an exact measurement of 
A will yield the corresponding eigenvalue. In the more general case the 
state function is not an eigenfunction of a. It is now postulated that an 
exact measurement of A must yield one of the eigenvalues. In general, 
different values will be obtained for each system and the average value 
(sometimes called the expectation value), of the results is 


(A)= i) W*a dr. (3.21) 


Probability density 


From (3.21), the expectation value of the co-ordinate q; is 
(qi)= fUtqyY dr 


if Y is normalized. This implies that ’*W dz represents the probability 
that the co-ordinates lie between q; and q; + dq;. That is, ¥* Wis a 
probability density. 

For a single particle system, the wave function depends upon the 
co-ordinates of the particle x, y, zand Y*W represents the probability 
density of the particle. It is no longer possible to state with certainty 
the position of the particle. 


Postulate 6. Completeness of eigenfunction sets 


Postulate 1 says that every state of a system is described as completely as 
possible by a wave function. At any instant, all such state functions 
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belong to some function space which is characteristic of the system. 
(This space is an abstract Hilbert space. The reader is referred to Quantum 
mechanics, by G. L. Trigg.) As the Hamiltonian operator has an infinite 
number of linearly independent eigenfunctions each of which can 
represent a state of the system, then it is clear that the function space is 
of infinite dimension. 

Suppose Wj is a state function of the system and is also an eigen- 
function for the operator representing some observable of the system. 
It is now postulated that the infinite set of all such orthogonal eigen- 
functions span the system function space. That is, the set of eigen- 
functions is complete in that any state function can be expressed as a 
linear combination of them. For the moment it will be assumed that the 
operator spectrum is discrete and 


W= DoW. (3.22) 
J 


If the eigenfunctions are quadratically integrable and are normalized 
the expansion co-efficients 


oy = J UP W ar. (3.23) 


Equation (3.22) shows that any state can be expressed as a superposition 
of other states. 

The reader will no doubt have met the concept of a complete set of 
functions in the theory of Fourier series. The functions 4, sin nx, cos nx, 
n=1,2,... forma complete set in that any piecewise continuous 
function f(x) with period 27 can be expressed as a linear combination 
of them, i.e. 


a oo 
fe=z + > G, cos nx +b, sin nx) 
ri 
with 


Tv nT 
1 1 
an =~ | f(a) cosne dx and bn =~ | fG8)sinnx dx 
-T 


Theorem 4 


From the previous postulate, any state function Y can be expressed as a 
linear combination of the orthonormal eigenfunctions {¥;} of the 
operator a representing the observable A (3.22). The corresponding eigen- 
values are a;. An exact measurement of A must yield one of these eigen- 
values and the probability of observing a; is 


P; =cpC;= le; |?. (3.24) 
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Proof 


Expand W as a linear combination of the orthonormal eigenfunctions 
{Wj}. From (3.21) the expectation value of A is 


(A)= [detvia Leh; dr= 2S cheyy J winy ar = © le, Pay. 


So |c;|? can be taken as the probability of observing the eigenvalue a;. 
Note that if V is normalized 


[wey ar = Sle? =1. (3.25) 
i 


If the value a; is obtained on measuring A then assuming the eigen- 
value is non-degenerate, the act of measurement must have disturbed the 
system changing the state function from W to the eigenfunction W;. In 
general this function will change with time but if the observable A is 
immediately measured again the value a; will once more be obtained. 

If the eigenvalue is degenerate, (3.24) must be summed over the 
degenerate eigenfunctions. 


Continuous spectrum 


The equations (3.22-3.25) assume the operator has discrete eigenvalues. 
There are operators however which have eigenvalues running continuously 
over all or part of the spectrum. Suppose a is an operator for which the 
eigenvalues are continuous. (An example is the position operator.) The 
generalization of (3.22) for an arbitrary wave-function W is 

v= [c(@)¥, da (3.26) 


where the integration is over all permissible values for the eigenvalue a. 

The problem of finding the expansion coefficients is complicated by 
the fact that the eigenfunctions Y, are not quadratically integrable. If 
the wave-function W is quadratically integrable then from (3.26) 


fuev dr= [J ercanudt da]v dr = fer@|furw dr] 140307) 


assuming the order of integration can be reversed. It would seem reason- 
able that |c(@)|? da should represent the probability that the observable 
has a value between a and a + da and as the sum of all probabilities is 
unity (cf. (3.25)) 


[le@P da=1. (3.28) 
If © is normalized to unity 


fora] [wiv dr| da= } c*(a)c(a) da 


c(@)=| ¥E¥ ar. (3.29) 
Compare this result with (3.23) for a discrete spectrum. 
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The reader is asked to compare (3.26) and (3.29) with a function 
f(x) satisfying [.. |f(x)|? dx finite, and its Fourier transform g(y). 


pe an 
I@)= 


with 


ico el dy 


g(y)= —— =| e f(x) dx. 


It is now shown that the eigenfunctions of an operator with a con- 
tinuous spectrum are not quadratically integrable. From (3.26) and (3.29) 


c(a)={ wfc’ wa da’ dr. (3.30) 
Reversing the order of integration 
e@)= fea’ j eV, dr da’. (3.31) 


(Strictly this step is not permissible since the inner integral in (3.31) is 
divergent.) For this result to hold true for arbitrary wave-functions then 
the value of c(a) cannot depend upon the value of c(a’),a' #a, ie. 


futwe dr=0 a’ #a (3.32) 
In addition, to satisfy (3.30) and (3.31) 

} [| WIV, dr] da’ = 1. (3.33) 
These results imply 

J wy, dr=%. (3.34) 


The ‘improper’ function with these properties is the Dirac delta function 
5(x) ‘defined’ by 

5(x) =0 x#0 

5 (0) = 00 


fs@ dx = 1. (3.35) 


Clearly the delta function is not analytic in the usual sense and a 
more formal definition may be given as the limit of a sequence of 
analytical functions satisfying (3.35). There is not a unique sequence 
with this property but a useful one is 

sin? ex 


6@)= li ‘ 
() reat nxe G 30) 
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sin?ex 


0 zt x 
& & 


Figure 3.1 Delta function sequence. 


The reader is asked to confirm that (3.35) is satisfied. Note that 
co . 2 
{ sin? B vo 


Strictly the delta function is only meaningful under an integral sign and 
the limiting process taken after the integral is evaluated. 
From (3.32), (3.33), (3.34), (3.35), 


[wrt dr=5(a' — a). (3.37) 


Equation (3.37) is the generalization of (3.12b). The eigenfunctions 
for different eigenvalues are still orthogonal but they are not quadratically 
integrable and cannot be normalized to unity. 

The corresponding result in Fourier transform theory is 

te ey eixy' dy =8(y—y’). 
2a 


PROBLEMS 


1 Classically, the angular momentum of a particle with respect to the 
origin is L =r A p where p is the linear momentum. Show that in the 
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Schrodinger representation the x-component of the angular momentum 
operator is 


_af{a a 
aan (3-25), 


Confirm that the commutator 
[ Fx; Ly) =ihL,. 


2 The classical Hamiltonian for an electron (charge —e) in an electro- 
magnetic field is 


sl 
H= 1 @+eay —ed. (1.45) 


When expanded this becomes 
2 


p 


e e 
H=-—+—p.A+—— 
2m m? 


2 

A*— ed. (a) 
2m 
Alternatively it could be written 

2 2 
p e e 4 
=+—+—(p.A+A.p)+—A* — eg. 
Se ome A-p)+>7 eg. (b) 

Write down the corresponding operators using the substitutions (3.8) and 
show that (b) produces a Hermitian operator whereas (a) does not. The 
operator obtained from (b) is the correct quantum-mechanical Hamiltonian. 


3 The classical Hamiltonian for a linear harmonic oscillator is 


1 
H==— p* ++ mex? 


2m 2 
Derive the Hamiltonian operator in both the Schrodinger and momen- 
tum representations and write down the corresponding time-independent 
Schrédinger equations. Explain why the eigenfunctions of these two 
equations are identical in form. 


4 Schrédinger’s time-independent equation for a particle moving along 
the x-axis in a conservative field is 


a? d? 
— 2 Set Ve HEV 


where V(x) is the potential energy and £ is the energy. Suppose there is 
a potential well defined by 


=—W —tSx<t 
V=0 |x| >t. 
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Integrate the Schrédinger’s equation across this well. Show that if 
V > —coas ¢ > 0 in such a manner that 2Vt = —k (finite) then 


ay dy\ __ 2m 
(=), = a) red 


wv and a 
dx r dx i 
are the right hand and left hand derivatives at the origin. This result 


illustrates that the derivative of the wave-function is not continuous at a 
singularity in the potential. 


5 Consider the functions 
Witx)=e™*, a(x) =e", x(x) =e 3%, ete. 
defined in the interval 0 < x <°. Show that these functions are quad- 


ratically integrable in this interval and normalize them. Using the Schmidt 
method obtain an orthonormal set of functions. 


6 Show that the function space (x3, y3, x3 + y3) has the dimension two 
and not three. 

Similarly show that the functions space (xy?, yz, z2x, x (y? — z?)) has 
dimension three and not four. 


7 A periodic function with period 27 is defined by 
f(x) =x —n<x<T 


Show that the associated Fourier series is 


5 (pt! = sin nx 
n=1 


8 Suppose {¢,(x)} is a finite set of n real orthonormal functions in the 
interval (a, b), i.e. 


b 
| 6:9; dx =8y. 
a 
Let f(x) be some real function and define ‘Fourier constants’ c; by 


b 
c= | f@)o@)dx iF 1,200 
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Consider the integral 


b 
7= | [F@)- Ln]? ax 


a 


where L,,(x) is some linear combination of the orthonormal functions 
and is written 


Ine) = 2 116i 


Show that the values of the constants y; that make J least are 
Vi = &i- 


It may be assumed that the integral 


b 
| f(x) dx 


a 
exists. 
Explain why 


b 
ce+cht...t2< [7?@) dx. 
a 


This result shows that the sum of the squares of the Fourier coefficients 
converges. 


9 The general form for a second order, linear homogeneous equation is 
d*x dX 
fo) yz + g(x) a + [n(x) +AK(@)]X=0 


where 2 is a constant. Show that by multiplying through by the integrat- 
ing factor 


1 7& 
—elF dx 


f 


this equation can be put into the self-adjoint form 
d dX 
x [reo “| + [a@)+%W@IX=0. — @) 


10 Suppose X, and X2 are solutions of the self-adjoint equation 
(equation (a) in question 9) corresponding to different ‘eigenvalues’ 
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A, and A, and that both solutions satisfy the same boundary conditions 


dX 
a,X tay ee atx =a 


dX 
b)X+by—=0 — atx=b. 


By multiplying the equation satisfied by X, through by X, and that 
satisfied by X2 through by X, integrating from x =a to x = b and 
subtracting the two, show that provided A; #A, 


b 
p(x)X,X> dx =0. 
a 


That is, the two solutions are orthogonal with respect to the weight — 
function p(x) over the range (a, b). 


11 Legendre’s differential equation is 
dy dy 
1—x?)—5 —2x= +11 + 1)=0. 
(1 —x*) dv? x de ( ) 

Put this equation into self-adjoint form (question 9), and verify that 
r(x) = 1 —x?. Observe that r(1) =r(—1) = 0. When / =n (integer) a 
solution of the equation finite at x = +1 is the Legendre polynomial 
P,,(x). Following a method similar to that used in question 10 show that 

+1 
} P,(x)Pm(x) dx =0 n#m. 
-1 
Note that no boundary conditions need be imposed upon the solutions 
in this case. 


12 The Dirac delta function has the property 


t) 


J 8@) dx=1. 


—9o 


Suppose 5'(x) is defined as the ‘derivative’ of 5 (x). Show that this 
derivative has the property 


fo (x) F(x) dx = (0). 


13 Using equation (3.29) substitue for c(@) in (3.26). Hence derive the 
‘closure relation’ 


f wi’) ¥aG@) da = 6 (G1, — 41)8(Q3 — 42)... 8G ~ Gn). 
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14 Inthe Schrodinger representation, the operator representing the 
co-ordinate q is itself. The corresponding eigenvalue equation is 


qv y= qa. 


q’ denotes the actual value of g. (Of course q has a continuous spectrum.) 
Explain why the eigenfunctions are of the form 


Wy =8(q-q’). 


Using (3.26) expand an arbitrary wave function ¥(q) in terms of the 
co-ordinate eigenfunctions. Show that the expansion coefficient 


e(q')=¥Q’). 
Deduce the probability that the value of the co-ordinate lies between q 
and q' + dq’ is 

Iq’)? dq’ 
(‘¥*W has already been shown to be the probability density.) 


15 The quantum mechanical expectation value for the kinetic energy 
of a particle of mass m is 


A? 
AS Ma arma { wy? y dr 
2m 
where w is the state function and the integral is over all space. Substitute 
for the Laplacian operator using the identity 
div (A grad B) = AVB + grad A. grad B 
where A, B are scalar-functions. 


If the wave function falls off faster than r7! confirm that (7) is real 
and positive. Deduce that the allowed energies of a particle satisfy 


E2 WV) 
where V is the potential energy. 


(Wave functions often fall off as e~°”. Hint: use the divergence 
theorem 


[dive dr= $a. as} 


s 
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Schrodinger’s Equation 


4.1 The Classical Limit 


The fundamental equation of classical mechanics is the Hamilton-Jacobi 
equation, 


(1.62) 


oS 8S \_ as 
8q,7 °° 8Gn’ ot” 


alan. +9 ns 


For a conservative system, the Hamiltonian is time-independent and the 
equation reduces to 


as as 
H (d15--54m 3+ FE : 
(« qm 5a, 1. (1.63) 


where F is the constant energy of the system. 

There is an obvious superficial relationship between these equations 
and the basic equations of quantum mechanics. The time-independent 
Schrodinger equation which every state function must satisfy is 


a a2 naw 
yes Ine on ee Ht ee, 
x(a eG aq1 i 8an ; (3.10) 


When the Hamiltonian does not depend on the time there are stationary 
states for which the energy is defined and they are obtained by solving 
the time-independent Schrédinger equation 


h oO h a 
ite sees TT) yy = Epy;- 3.16 
x(a. n> i aq1 i 2) iW; ( ) 
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Quantum mechanics includes classical mechanics as a limiting case and 
the Hamilton-Jacobi equation is valid when @ is small relative to other 
quantities with the same dimensions (i.e. as # > 0). 

A similar situation occurs in electrodynamics where wave optics 
includes geometrical optics as the limiting case when the wave-length is 
small compared to the dimensions of the physical problem (i.e. as A > 0). 
In a dielectric medium with zero charge density and zero current density, 
the components of the electric field E and the magnetic induction B . 
satisfy the wave equation 


2 
Viegas (4.1) 


where u(x, y, z) is the velocity of propagation of the wave. When the 
velocity is a constant there are plane wave solutions of the type 


f = Bellk.r— wt ta). (4.2) 


The direction of propagation is normal to the wave surface that is the 
locus of points which have the same phase at a given time. In the more 
general case there are solutions of the form 


f =B(x, y, z Hel®& » ‘) (4.3) 


where the phase (eikonal) & does not have the simple form of (4.2). In 
the limiting case when ® changes by large amounts in small distances 
(corresponds to A + 0) substitution of (4.3) in (4.1) shows that the phase 
must satisfy the eikonal equation 


a\?_ (ao\?_ [ae a@\? 
GOV" (ORV, gl OPV was (ee \ (4.4) 
ox oy Oz [u?] at 
This is the fundamental equation of geometrical optics. For waves of a 
single frequency w 
@(x, y, z, th=o(%, y, z) — wt (4.5) 


and the eikonal equation is 


ae 2 ao 2 3 2 2 
eye) ey “ 


This equation is analogous to the Hamilton-Jacobi equation for a particle 
of mass m in a conservative field, 


as\? (as\?_ (as\* 
(55) (5) (3) =2m(E —- V). (4.7) 


In geometrical optics the paths of rays are specified by Fermat’s 
principle. One statement of this is that the ray-path is such that the dif- 
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ference between its phases at the fixed end points takes a stationary 
value. In classical mechanics the motion of a particle is governed by 
Hamilton’s principle. All this suggests a correspondence between the 
phase of a wave and Hamilton’s characteristic function S. 

The wave function for a single quantum particle satisfies Schrédinger’s 
equation 


n _, Aoy 
—-—— WW +VY=—-—-_—. 4. 
2m y i ot oe) 

In analogy to (4.3) try a solution of the form 

Ue, yz, = BO, y, 2 Helm (49) 
and write 

S ? ’ , t 
= 5G%n9 Zasy (4.10) 


The constant h is introduced as the phase is dimensionless whereas S and 
h have dimensions of action e.g. Joule secs. Schrédinger’s equation 
becomes 


a? _, ih ih 4, B 5 
ae -— ; = — + 
es VB = grad B. grad S oni BV’S oe. (grad S)* + VB 
os OB 
=———Btih —. 4.11 
oo at Gl) 
In the limit of small #, this equation becomes 
as\?_ fas\? [as\? as 
—} tt] t(—] + =—2m —. 4. 
(5) (=) (3 at ” Ot G12) 


In the same approximation, the time-independent Schrédinger equation 
becomes 


2 2 2 
(#) (5) (F] = 2m(E - V) (4.13) 
0x oy az 

which is the Hamilton-Jacobi equation (4.7). In the limit that terms in / 
can be ignored the wave function changes according to classical rules. 
From equation (3.6), if # can be replaced by zero then all commutators 
are zero. This means that quantum-mechanical operators may be replaced 
by classical variables. 

The brief treatment above has indicated how quantum mechanics 
includes classical mechanics. Historically the approach was quite the 
reverse. Schrédinger argued that if geometrical optics required extension 
to include wave optical effects such as diffraction, then perhaps in 
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analogy, classical mechanics could also be extended. This lead to his 
theory of wave mechanics. 


4.2 One-Dimensional Problems 


In these idealized problems a single particle is confined to move in one 
dimension, along the x-axis, say. They are worth consideration since they 
bring out the essential elements of a quantum mechanical solution using 
only elementary mathematical techniques. In fact only a few real quan- 
tum mechanical problems have a rigorous analytic solution. 


Potential barrier of infinite width 
A simple example of this type of problem is a potential step function. 
The potential energy of the particle is defined by 
V(x) =0 x<0 
=Vyg x>0. (4.14) 


It is desired to find the eigenfunction solutions of the time-independent 
Schrédinger equation 


2 42 
= ss +V(x)W HEV. (4.15) 


Potential 


—-—~— Real part of 
w(x) when 
E<\wh 


Figure 4.1 Step function potential. 
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To the left of the origin the potential energy is zero and there are no 
forces acting. The particles are free and the eigenfunctions are 


W(x) =Aei* +BehiX x <0 (4.16a) 
with 
E=h?k?/2m (4.16b) 


where A, B are constants. 
To the right of the origin 


h? dy 
2m ax? 


+(Vo-—E)y=0 x>0. 


The eigenfunctions are 

W(x) = Celha* + De® = x >0 (4.17a) 
with 

(E — Vo) =h?k3/2m. (4.17b) 


C, D are also constants. 

Solutions of this problem exist only for F > 0. Then the energy 
spectrum is continuous and the eigenfunctions are not quadratically 
integrable although y can still be normalized over a finite volume. If a 
flux of non-interacting particles flows to the right from minus infinity 
y*w can be chosen to represent the particle density. Some particles _ 
may be reflected by the potential barrier and some transmitted. The 
terms Ae!*:* and Be /*s~ in (4.16a) respectively represent the incident 
and reflected particles. There are two cases to consider. 

(a) When E > Vo then classically all particles would be transmitted 
through the barffer. This is no longer true and | B/A|? gives the intensity 
of the reflected wave relative to that of the incident wave. In this case kz 
is real and the term e—?*2* (4.17a) represents a wave travelling to the left. 
Clearly for x > 0 no particles can flow to the left and D must be zero. 

As the potential is finite y, dy/dx are continuous at the origin, i.e. 


A+B=C 
YA — B)k, = Ck. (4.18) 


and 


(& 
Cs siete SIE Bs 4.19 
(4.19) 
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(The absolute values of A, B, C may be fixed by normalizing .) In the 
limit E > ©, B = 0 and C= A. Quantum mechanics is then in agreement 
with classical mechanics and all particles are transmitted. 

(b) When E < Vo then classically no particles would be transmitted. 
This is also true in quantum mechanics. In this case k3 is negative. Only 
the positive imaginary root is allowed as the negative imaginary value for 
k> represents a divergent solution that increases exponentially with x. 
That is D is again zero. Equations (4.18) and (4.19) still apply (with 
ky =i|k|) and B/A is a complex number of modulus unity. Then the 
reflected wave has an intensity | B/A|? = 1 corresponding to a reflection 
of all particles. Even so there is a non-zero probability that a particle will 
penetrate the barrier to a depth x. From (4.17a) this probability is 


lw(x) I? =|AP? 7 exp [—2\/2m(Vo — E)x/n] x>0. 


Classically this could imply that the particle has negative kinetic energy 
for x > 0. This surprising unphysical result is explained in quantum 
mechanics by considering the uncertainty principle (Chapter 5). 

A useful limiting case occurs as the potential step is increased without 
bound. From (4.18) and (4.19) 


lim B=—A and lim C=0. 

Vo —>oo Vo —> oo 
In this case the wave function becomes 

W(x) = 2iA sink,x x<0, w(x) =0 x>0. (4.20) 
The wave function vanishes at a boundary where the potential is 
infinite. This result is valid in three dimensions. 

e 

Tunnel effect 
Another problem where the energy spectrum is continuous occurs in the 


motion of particles past a potential hill. As an example consider a 
rectangular potential barrier 


V(x) =0 x<0 t 
=Vo O<x<a 
=0 x>a. (4.21) 


Again solutions exist only for E > 0. (Note: the minimum potential 
energy is zero.) 

The most interesting solutions occur when the energy of the particles, 
incident from the left, is less than the height of the potential barrier. 
Classically none of these particles would be transmitted but this is not 
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Yo 


es cnn 
Incident 
particles 


—— Potential 


—-—— Real part of 
w(x) when 
E<vkh 


Figure 4.2 Rectangular potential barrier. 


the case in quantum mechanics. The Schrédinger equations to be solved 
are 


way p,  x<0 
2m dx? x>a 
hn? d? 
— 5 LE + Ved = BY 0<x<a (4.22) 
When £' < Vo, the solutions are 

W(x) = Agikx + Be7ik* x<0 
W(x) = Ce** + De~°* O0<x<a 
W(x) = Felk* x>a 


with 
1 1 
dare (2mE )'2 and a=e [2m(V> - E)}"/?. (4.23) 


a is real. Note that there is no reflected wave for x >a and so there is no 
term involving e~!**, 

The boundary conditions are that the wave function and its derivative 
be continuous at x = 0 and x =a. These determine the values of the 
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constants relative to A. (As in the previous problem the absolute values 
may be fixed by normalizing y.) The continuity of and its derivative 
at x = 0 give 


A+B=Ct+D 

ik(A — B)=a(C — D). 
Similarly at x =a 

Ce%2 + De~™4 = Felka 

a(Ce“4 — De~%*) = ikFe'**. 


These equations can be written 


B20 De 
A A A 
B C 
ea ey ig 
D ug 
a oy —aq = ika__=9 
er a ae” aaa 
c —aa *1-pika 
ot ee — ike’ —=0 
ae A ae ike 
Solving for F/A 


4iak 


F 
a a a ee 4.24 
A é[4iak cosh aa — (a? — k?)2 sinh aa] ot) 


The number of particles transmitted through the barrier is proportional 


to | F/A\?, ie. 
E 
A 


a 4 
= ao ee . 
E cosh? aa + (¢ - 3) sinh? aa 


If aa is large then both cosh aa and sinh aa behave as e®*/2 and 


F\? 16a7k?  _ 
4| “@aeee ae 


The number of particles transmitted falls off exponentially as 
2m 1/2 
exp 2 v0) a}. 


Note that in the case that A can be neglected (i.e. h > 0) the classical 
limit of no transmission is obtained. 
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(b) 


Figure 4.3 (a) The potential function for an o-particle in a uranium nucleus. 
(bo) The potential function in a crystal. 


Tunnelling of particles occurs in radioactive decay. In the case of 
uranium an a-particle experiences the repulsive nuclear electrostatic field 
up to about 107!* m from the nucleus. Inside this distance the forces 
are attractive and the a-particle is in a potential well. It is observed that 
a-particles are emitted with energies less than that necessary to penetrate 
the barrier of the nuclear field. For example in the case of 73$U the 
height of the potential barrier is about 30 MeV whereas the kinetic 
energy of the emitted a-particles is only 4-2 MeV. 

The smaller the mass of a particle the greater the probability of 
transmission. This explains why in metals electrons can move from atom 
to atom through the potential barriers. If the barrier height is taken 
~1 eV and a ~S5 A the barrier is not very transparent and the electron 
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has a probability of about 10-2 of penetrating at each ‘collision’ with 
the barrier. However if it is considered that the electron is oscillating 
inside the potential well with wave-length equal to twice the Bohr radius 
(ag = 5-32 x 1071! m) then from de Broglie’s hypothesis (2.42) the 
electron velocity will be h/2m ag and will collide h/2m az ~ 1017 times 
per second with the barrier. Such an electron will escape through the 
barrier after ~107!5 seconds. Clearly it cannot be considered that such 
electrons are bound to particular atoms but rather that they travel 
throughout the crystal. 

In the case when E > Vo it can be shown in a similar manner to the 
above analysis, unlike the classical case, that not all particles are trans- 
mitted. Some are reflected. 


Harmonic oscillator 


Consider a particle of mass m oscillating in one dimension. Classically if 
the force acting is -kx then the potential energy of the system is kx?*/2. 
The Hamiltonian operator is 


Bd? 2G 
fp ee Se aS 4.26 
amax? 2 G2) 
and the time-independent Schrédinger equation is 
ee ee 
_— — +— = : . 
2m dx? ye ey O22) 


It is usual to introduce the non-dimensional quantities 


_2VmE _ [mk \"4 
and then 
dy 
ae (y-»?) W=0. (4.29) 


The potential energy tends to infinity as | x | > °° and its least value 
is zero. Solutions exist for £ > 0 and the wave function must vanish 
as | x | > ©. It is well known that (4.29) possesses solutions which tend 
to zero as |x| > © if and only if takes one of the values (1 + 27) where 
nis a positive integer. The corresponding solutions are 


Wy) =A e”? HAY) (4.30) 


where H,,(y) is the Hermite polynomial of degree n. (See Special functions 
of mathematical physics and chemistry, by Sneddon.) The energy spectrum 
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is discrete and so the eigenfunctions are quadratically integrable. A is a 
normalization constant. The first four polynomials are 


Ao) = 1 

H,(y) = 2y 

H,(y)= 4y? —2 

H3(y) = 8y? — 12y. (4.31) 


The wave function y,,(y) is odd or even as n is and has nodes. 
Corresponding to the allowed values of y the discrete eigenvalues are 


1 [k 
En (n i) — n=0,1,2,.... (4.32) 
2 m 


The analogous classical oscillator has an angular frequency w = (k/m)!/2 
and in terms of this frequency 


E,=(nt+4)hw n=0,1,2,.... (4.33) 


Pi2= [Ws2l? 


Figure 4.4 Probability distribution for n = 12. The classical value is represented 
by the dashed curve. 
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The term niu gives the Planck series of energy levels; the energy of 
the ground state is not zero but takes the value hw/2. This is calied the 
‘zero point’ energy and is a manifestation of the uncertainty principle to 
be discussed in the next chapter. Classically a particle oscillator would be 
in the lowest energy state when at rest at the centre of the oscillation. 
But this gives a precise value to x and in quantum mechanics it is not 
then possible to give precise values to momentum and energy. For large 
quantum numbers the wave function probability density approximates 
to the classical probability curve for an oscillator with the same total 
energy. 

In this classical case the probability is least at the midpoint of the 
oscillation where the particle velocity is greatest. This is an illustration 
of the correspondence principle discussed in Chapter 2. 

All the oscillator eigenfunctions, except the ground state, have one or 
more nodes. The presence of nodes in a wave function leads to a classical 
paradox. As the probability density is zero at a node then apparently the 
particle must pass from one side to the other at infinite speed as it 
oscillates. The quantum mechanical answer to the ‘node paradox’ is that 
the momentum and position of a particle cannot be known simultaneously 
and so this trajectory problem does not arise. (For a further discussion 
see Kemble, p. 86.) 

The simplest application of the linear oscillator is in the discussion of 
the vibrational energy levels of diatomic molecules. 


4.3. Three-Dimensional Problems 


Particle in a box 


Consider a cube of side a with the origin of the co-ordinate system at a 
corner of the box. Inside the box the potential energy of an electron is 
zero and outside the potential is infinite. This is essentially the model 
for a metal suggested by Sommerfeld in 1928. 

The time-independent Schrédinger equation is 


2 2 2 
ay Fy ay 


ax? * ay? * az? +k?y=0 (4.34) 
inside the box, where the electron kinetic energy is 

E =h?k?/2m. (4.35) 
As EF > 0 then k must be real. 


The partial differential equation (4.34) is solved by the method of 
separation of variables. By writing 


Wy, 2)=X()Y(y)Z@) (4.36) 
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equation (4.34) becomes 
1d?X¥ 1d4?Y 1dZ 
= te ets a tk’ =0. : 
X dx? Y dy? Z dz? ? 437) 


The first term does not include y or z, the second does not involve x or z 
and the third does not involve x or y. This means that 


1 a?Xx 

x ax? *HmO 

ie eee 

Y ay? k2 =0 

1 d?Z 

Z dg te (4.38) 


where k,,, ky, k, are constants such that 


k? =k2 +k? +k?. (4.39) 
Each of the equations (4.38) has a general solution of the form 

X(x) =A sin k,x +B cos kyx kz>0 a) 

X(x)=Ax +B K2=0 = (b) 


X(x) =A sinh |k,|x +B cosh |k,|x K<0 (c) (4.40) 


where A, B are arbitrary constants and of course are not the same in the 
three cases (a), (b) and (c). 

The boundary conditions require that the wave function vanishes at 
the box walls where the potential is infinite. The only solution which 
can satisfy these conditions is (4.40a), and the wave function (4.36) is 
then composed of terms of the type 

sin : sin 
Kyx  kyy zz. (4.41) 


This wave function must vanish at all points on the cube surface and the 
only product satisfying this condition is 
A sink,x sin kyy sin k,z 


with 


hg=—, yt (4.42) 
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where J, m, n are integers. The electron energy takes the discrete values 


2 
pe 5 (1? +m? +n7). (4.43) 
8ma 


It is very important to note that it is the boundary conditions that 
quantize the energy. In general the energy spectrum is partly or wholly 
discrete if a region of space is surrounded by a potential wall. In the 
present problem the electron is localized within a cube by an infinite 
potential barrier whereas in the previous problem (oscillator) V(x) > 
as |x| > ©, The energy spectrum is wholly discrete in both these cases of 
a finite and infinite domain. 

However if the potential is such that V(x) < 0 inside some closed 
region and V(x) = 0 outside this region there is a discrete set of bound 
states for E <0 and a continuous spectrum of states for E > 0. The 
wave functions for these latter states are not quadratically integrable and 
represent ‘infinite’ motion of the system. 

More generally, if the potential tends to a finite value at infinity there 
will be a continuous energy spectrum for values greater than this. 

This problem is the basis of the free-electron model for solids. 
Although this application is apparently crude it is surprisingly useful. 


E>0 Continuous 
spectrum 


E<0 


Discrete 
spectrum 


Figure 4.5 A finite potential well. 
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Rigid rotator 


It is convenient at this stage to discuss the system of two interacting 
point particles the results of which will not only be needed in the present 
problem but also in the following treatment of the hydrogen atom. 
Suppose the particles have masses m, mz and are at the positions (x1, 
¥1,21) and (x2, y2, Z2) respectively and suppose that the potential energy 
is a function of the relative displacement of the particles, i.e. 


V=V(xq —%1,¥2 —¥1,22 — 24). (4.44) 


The classical Hamiltonian for this two-body problem is 


where P,, are the momenta of the particles. The corresponding 
operator is obtained by substituting 


a a 
PP =f? | stats p= deo. 
‘ (= dy? az? : 


The time-independent Schrédinger ‘eit is 


® [aa oo a0 822 | 272 va] 
--- 


_ + —z tT 
2m, | ax? oy az? ax oy az3 


+VQ=EQ (4.45) 


where Q = 2.(x1, 1, 21, X2, ¥2, Zz) and E is the energy of the system. 
When the potential takes the form (4.44) this equation can be 
separated into two equations one of which represents the system trans- 
lational motion and the other the relative motion of the two particles. 
This separation is obtained by introducing six new co-ordinates, first of 
all the co-ordinates of the centre of mass of the system (x, y, Z) and 
secondly the co-ordinates of the second particle relative to the first. 


pu IAT axe pal tM2y2 oe Mazi toto 
my +m : my, +m) > my, +m, 
x = (x2 — x4), y=(¥2 - V1); z= (z2 — 24). 
(4.46) 


It is left to the reader to confirm that (4.45) becomes 


1 ag 872 372 1(e9 vO, ee) 
TIES A= tos t= —a to 
(m, tm) ay? az ax? ay? ae? 


= = [E — V(x, y, z)]Q (4.47) 
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where the ‘reduced mass’ of the system is 
mm 
=—_2 4.48 
age erean (4.48) 
It is now possible to separate this equation by writing 
Q(%, ¥, ZY, ZALE Y, DVR Y 2) (4.49) 


When this expression is substituted into (4.47) and the equation divided 
by the product fy it is found that the left hand side is independent of 
(x, y, z) and the right hand side is independent of (x, y, Z). Each of 
these parts must be equal to a separation constant which is conveniently 
taken as —2E;/h?. The resulting two equations are 


ay 97 27,2 


x2 re +532 urea +m,)E,f=9 (4.50) 
oy OY eee) Vv=0 (4512) 
axe ay? one ae me 


Equation (4.50) is simply Schrédinger’s equation for a particle of mass 
(m, +m) moving in field-free space. The parameter £ represents the 
translation energy of the system. Equation (4.5 1a) is the Schrodinger 
equation for a particle with mass equal to the reduced mass of the system 
moving in the potential V(x, y, z). The parameter Z, = (E — E;) represents 
the vibrational and rotational energy of the system. It is often useful to 
express (4.51a) in spherical polar co-ordinates. 


x =rsin @ cos ¢, y =rsin 6 sin 4, z=rcos@. 
Then 
1 a{,av 1 av 1 af. oy 
?? ar ( | sin? ag? sind 20\"" ° 26 
2 
+35 [Ey -— V]w =0. (4.51b) 


Consider now the case of the rigid rotator where the masses are kept 
at a fixed distance a apart and where V may be taken as zero. The 
Schrddinger equation (4.51b) then represents a mass moving over the 
surface of a sphere of radius a, i.e. 


1 ep 1 2a aw\ 2I 
——— + —— H cael Ps + —> E. = 0 4.52 
sin? @ 3g? sin @ 30 (in 2 Es 2) 


where J = pa?. The wave function depends only upon @ and ¢. This 
equation may be separated by writing 


¥(6, o) = O@)P@) (4.53) 
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and dividing by the product O®. 


_i @@, 1 af, @)\s Ae, 0. 
sin? @@ d¢?_— sin 6© dé 


This can be written as the two equations 


—> +m? = 4.54 
d@ do I m 

—> +— cot 0+ —E, —— @= 4.55 
a oe ee ae G2?) 


where m2 is the separation constant. The solution of (4.54) is 
® =A sin m@ +B cos md. (4.56) 


Since a wave function must be single-valued and continuous ®(@ + 27) = 
®(¢) and so m must be restricted to integer values. If the substitution 
a=cos @ is used in (4.55) then the ©-equation becomes 


2 2 
a a) S320 2-5 10-0 -l<a<l 
dow 7 7 (4.57) 
where 


= 2/E,/h?. 


This second-order equation has two solutions but in general these are 
both infinite at a = +1 (i.e. @ = 0, 7). Such a solution is not physically 
acceptable. But if takes one of the values 


A=K(K+1)  K=0,1,2,... (4.58) 
with 
K>\ml 


then one of the solutions is finite in the range —-1 <a < 1 while the other i: 
not. The equation (4.57) is Legendre’s associated equation (see Sneddon’s 
book) and the acceptable solution is the associated Legendre function 
Pi (a), i.e. 

@Q(0) = PR’ (cos 6). (4.59) 

So the physical restrictions on the wave functions require the energy 

to take the discrete values 

Be K(K + 1)h? 

7 21 


This simple model is useful in the theory of band structure of molecules. 


Ke0A ex (4.60) 
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The infra red spectrum of diatomic molecules 


The more rapid electron motion in a molecule can be separated from 
the nuclear motion of a molecule (Born-Oppenheimer approximation) 
and the molecular energy (apart from translational energy) can be con- 
sidered to be composed of three parts 


(a) the electron energy, 
(b) the vibrational energy of the component atoms, 
(c) the rotational energy of the molecule as a whole. 


Each electron energy level has a fine structure of vibrational energy 
levels and these in turn have a rotational fine structure. This section is 
concerned with the fine structure due to the atomic vibrational and 
molecular rotational levels. 

Classically, an accelerating electric charge produces an electromagnetic 
field. Molecular vibrations and rotations involve oscillations of charges 
and produce emission (absorption) of radiation. The frequency of this 
radiation is equal to the frequency of the corresponding molecular 
motion. For diatomic molecules the vibrational frequencies are in the 
infrared region ~10!3-10!4 Hz and the rotational frequencies are lower. 
Infrared and microwave spectroscopy of gases are powerful methods for 
helping to determine molecular structure. 

A diatomic molecule can be regarded as a dumb-bell rotator-oscillator. 
In the previous discussion of the rigid rotator the potential function 
V(r, 0, ¢) in (4.51) was taken to be zero. A simple model for diatomic 
molecules is to assume the simple harmonic potential 


V(r) =4k(@ — ro)? 


where ro is the equilibrium separation of the nuclei and k is a force 
constant. A detailed analysis of the rotator-oscillator (e.g. Pauling and 
Wilson, p. 263) shows that to second-order terms the energy levels of 
the system are given by 


h? n* 
Enxk =(nth)hw + K(K +1) 57 - KK + 1? TE 
n=0,1,2,... 
K=05452)54 


where w is the frequency of oscillation of the molecule and Tis the 
moment of inertia. The first term is due to the molecular vibrations 
(4.33) and the second term is due to the rotation of the molecule (4.60). 
The third term is a correction term arising from the centrifugal stretching 
of the non-rigid molecule during the rotation. The corresponding eigen- 
functions are 


VanK@, 6 ’ ) = Rnr@)PK (cos aye" 
and R,,(r) has the general form of (4.30). 
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Figure 4.6 The energy levels Ey, x for the non-rigid rotator-oscillator. 


The infrared spectrum observed depends upon the allowed energy 
transitions and these are determined by the selection rules. In Chapter 8 
it is shown using the dipole approximation that a transition of a charge 
from a state i to a state f is possible if the integral (8.55) 


[xfex: dr 


is non-zero where x; and x¥ are the initial and final state functions and r 
is the position of the charge. 

The properties of Hermite functions (Chapter 6, problem 14) restrict 
the allowed transitions for a harmonic oscillator to those between 
neighbouring energy levels so that the selection rule is 


bn=41. 


Consequently the emission or absorption spectrum of a harmonic 
oscillator should consist of one sharp line with a frequency equal to that 
of the oscillator. 

Analysis of the rotator-oscillator eigenfunctions shows that the 
selection rules are 


6n=0,+1,+2,... 6K =+1. 


The transitions with 6K = +1 form the R-branch and those with 6K =—1 
form the P-branch. The infra-red absorption spectrum (no electron 
transition) of heteronuclear molecules such as HCl conform to these 
rules. Each vibrational spectrum line has a fine structure (Fig. 4.7). 
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Note that the rule 5K =+1 means that there is no central line to the 
structure. 

There is no infra-red absorption spectrum for homonuclear diatomic 
molecules such as H> or O3. This is of course to be expected as in the 
vibrations of a homonuclear molecule there is no net movement of charge. 

It can be shown that when the electron motion is taken into account 
the above selection rules are correct on the assumption that the electrons 
have zero angular momentum about the nuclear-axis. (2 state.) This is 
the case for almost all diatomic molecules (NO is an exception) in the 
ground state. However, if the electrons do have a non-zero angular 
momentum about the nuclear axis then the transitions with 5K = 0 are 
allowed and form the Q-branch. 

Transitions involving a change of electron energy are very complex 
and in this case homonuclear molecules can show a vibration-rotation 
spectra. For further information the reader is referred to Introduction to 
Quantum Mechanics, by Pauling and Wilson, and The Wave Mechanics of 
Atoms, Molecules and Ions, by Schutte. 


Central field Coulomb potential 


In this final section of Chapter 4, an outline treatment is given for the 
hydrogen atom problem. The first part of the previous section is of 
importance. Consider the motion of a single electron of mass m ; and 
charge —e in the coulomb field of a nucleus of mass m2 and charge Ze. 
The potential energy of the system depends only upon the separation of 
the particles and is 

2 


Vin=- Ze 


4neor 


where €g is the permitting of free space. 
The analysis from (4.44) to (4.51) applies and the energy (excluding 
the translation energy) E, of the system satisfies 
ay 1 a7y 1 a{. .ay 
—_— a= 22° + - Ch — 8 —— 
? br ( ar P sin? @ O67 7 sind 26 (in 30 
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where r, 0, @ are the polar co-ordinates of the electron relative to the 
nucleus. As m2 > my,, the reduced mass yp is effectively equal to the 
electron mass. This equation may be separated by writing 


WO, 9, 6) =R(r)OO)H(4) : (4.62) 
and dividing by the triple produce RO®. The separated equations are 


rb 4, 

aime =0 (4.63) 
d@ do m} 

spe teat See lia+ 1) i le- 0 (4.64) 
1d/.dR)\ {2u Ze? 1G+0)\ p 

= — |r? —)4+( le, + ‘ 

r dr (- dr Fe = a)- r? = C8) 


where m} and /(/ + 1) are the separation constants (see rigid rotator). 

As ® must be single-valued and continuous the constant m, must be 
an integer. Equation (4.64) is the transformed Legendre’s associated 
equation and as the wave functions must be bounded in the range 
0<6 <7 the required solution is 


© = Pi" (cos 0) (4.66) 


with / a positive integer or zero and / > | m,|. 
The radial equation (4.65) may be solved by writing 


Qu uZe? 
2=_—-SE,, rae ee 4.67 
ss ‘a 6 4negh?a ed) 
and changing the independent variable to p = 2ar. Then 
WR 2dR 1 it 1), 8 
ar ae => R=0. 4.68 
dp? p do ‘| 4 p? ee) 


Observe that for bound states E, < 0 and a is real. The physical require- 
ments of single-valuedness and continuity have been imposed on the 
wave-function but no boundary conditions have been applied up to this 
point. These boundary conditions are 


(i) Y>Oasr>e — (ii) Wis finite asr > 0. (4.69) 


Equation (4.68) is discussed in books on special functions (¢.g. Sneddon), 
and solutions that satisfy R > 0 as p > and as p > 0 exist only if 8 is 
an integer n, such that n 21 + 1. The solutions are the Laguerre functions 


Ry(o)= Pp L2N(p) nn SI +1 (4.70) 
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where L241] is an associated Laguerre polynomial. The restriction on B 
results in a discrete energy spectrum 


E,= —Ze*p/2n?n?(4neo) n=1,2,.... (4.71) 


The quantum number n determines the energy and the degenerate wave- 
functions associated with this energy are 


Vnim@, 8, ¢) = Anim Ru (2ar)P{" (cos aye (4.72) 
1=0,1,..., (n —1) 
m,=-1, -(-1),....0,.-.41 


where Anim, is a normalization constant chosen so that 


f wy dr=1. 


all space 


The number of degenerate wave-functions belonging to the nth energy 
level is 


n—-1 
> (1+1)=n?. 
1=0 


The quantum numbers / and m, are related to the angular momentum of 
the system and the degeneracy with respect to m, is associated with the 
rotational symmetry of the atom and occurs in any central field problem 
(see Chapter 9). However, the degeneracy with respect to / is a property 
peculiar to the coulomb potential and is not readily explained in terms 
of the atomic symmetry (see Kemble, Quantum mechanics, p. 312). 

The ground state (lowest state) of the electron occurs when n = 1 and 
the single normalized wave function is 


Lf Z A h? 4ne 2Zr 

= = — [— —p/2 = 9 = 

W100 = (is) Vi (2) e aq jet 7) na 
(4.73) 


The next level corresponds to n = 2 and has four degenerate wave 
functions associated with it. 


_ _ 1 Z 3/2 pl 
W200 = (2s) = Jin (2) (2—p)e 


3/2 
2) pe°!2 cos 0 


_ bass 
W210 = (2pz) = 4/2n (2 


1 Z 3/2 . 
Woie1 = Z| pe Pl? sin Be*!9, (4.74) 
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From the last two it is possible to construct the functions 


(2px) 1 zZ \32 a cos } 
= —j} pe°l sin 6 (4.75) 
(py) 4v2n fe ane 


The formula (4.71) gives the basic structure of the atomic line 
spectrum but it does not give the fine structure. This is essentially a 
relativistic phenomenon and is connected with the concept of electron 
spin. 

Positive values of the total energy correspond to unbound states in 
which the electron kinetic energy is greater than the potential energy. 
The energy spectrum is then continuous and the wave functions are not 
normalizable. 


PROBLEMS 


1 Consider the one-dimensional Schrédinger equation (4.15). If w, 
Wz are two different eigenfunctions corresponding to the same value of 
the energy show that 


d?y d7y 
Gat V2 ge 4170. 


Integrate the equation to obtain 
dy; 


d 
age? -—Wy ae constant. 


If the spectrum is discrete both y, and W vanish at infinity. By inte- 
grating a second time show that y, and W2 are linearly dependent. This 
result shows that in a one-dimensional problem the energy levels of a 
discrete spectrum are not degenerate. 


2 Suppose the potential in a one-dimensional Schrédinger equation is 
symmetric, i.e. 
V(—x) = V(x). 


By carrying out the change of variable x -> —x show that in the non- 
degenerate case the eigenfunctions are either odd or even, i.e. 


W(—x) = FY). 


3 Carry out the analysis of the tunnel effect in the case FE > Vg (4.22). 
Show that contrary to classical predictions some particles are reflected. 


4 With the notation of (4.26) show that the frequency of a classical 
oscillator is w = (k/m)!/2. From energy considerations show that the 
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maximum displacement of the particle is x9 = (2E/k)'/? where £ is the 
total energy of the oscillation. 

Define P(x) dx to be the probability that the particle is in the element 
dx. This probability must be proportional to time d¢ spent in dx, Using the 
harmonic solution x =X sin wt together with the relation dt = dx/x show 
that a 


[= x 1/2° 
a 


5 The potential in a three-dimensional problem is defined by 
V=-Vo r<a, V=0 r>a Vo > 0. 
(r is a spherical polar co-ordinate.) 
Show that the spherically symmetric bound solutions (E <0) which 

satisfy the boundary conditions 

(i) w is finite at r=0 

(ii) y and dy/dr are continuous at r =a. 
ii) y>Oasr>% 


P(x) dx = 


(See Fig. 4.4.) 


are of the form 


—or = 
=—e r>a a= 
r nh? 


2 
y= sin Br r<a B= 57 (Vo +B). 


(m is the electron mass.) 
Show that a necessary condition for a non-trivial solution is 


aa = —$a cot (Ba). 
aand 8 must also satisfy. 
2 
(aa)? + (fa)? = 57 Von. 
Illustrate these conditions graphically. 
Explain why bound solutions exist only if 
2m Vo T 


Pe) a>,- 


(The spherically symmetric Laplacian is 
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6 The potential in a one-dimensional problem is defined by 
=—Vo |x| <a, V=0 |x| >a, Vo >0. 


The correct boundary conditions on the eigenfunctions are 
(i) w.> 0 as |x| > % (Bound states). 
(ii) w and dy/dx are continuous at |x] =a. 


Solve the time-independent Schrédinger equation. 
Show that if 


2m|E| 
h? 


a>1 


then the simplified boundary conditions p = 0 at |x| =a may be used. 
Verify that the eigenfunctions are either odd or even. 


7 The potential inside a cylinder of radius a and length / is —-Vg(Vo > 0) 
and is zero outside. 

The co-ordinate axes may be chosen so that the z-axis coincides with 
that of the cylinder with the origin at the centre of the lower plane end. 
Schrédinger’s equation in cylindrical polars (p, ¢, z) is 


n?[1 9 a\ 1 3? 8? 
—-—|-—|p-—ltoosta lv +E]y= 
2m F 0p (55) p* ag? az? age La a 
p<a 0<z<l1 0<@<2n. 


For |£| sufficiently large the boundary conditions reduce to y = 0 on 
the cylinder boundary. Use the method of separation of variables to 
show that the solutions finite and single-valued within the cylinder are 
of the form 


n 


NZ cosr@o 
=A sin J (a 
¥ 1 (ap) sinr¢ 
where n is a positive integer, r is a positive integer or zero and J, (aa) = 0. 
Find the corresponding eigenvalues. (The Bessel function J, (x) is a 
solution of 


dy? 1 dy r? 
sazt- til -——s]y =0. 
dx? x dx x? ad 


8 Suppose two interacting particles have masses m, and my and are at 
the positions (x1, 1,21) and (x2, ¥2, 22) respectively, and suppose that 
the potential energy is a function of the distance between the particles. 
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The classical Lagrangian is 


m E a . m2,. . » 
Lat EH HED tO HI DV 


Carry out the change of variables (4.46). Find the momenta conjugate to 
the six new co-ordinates and derive the classical Hamiltonian 


1 


1 P3 Pe 
H=————_ P2 + Pr + 2) 4 — p+ + —— + 
2m ay erty te) Qu | Ee vo) 


r r? sin? 0 


where p is the reduced mass (4.48). 


9 The classical Hamiltonian for a rigid rotator is (see Chapter 1). 


1 1 
H@6, ¢, Po, Py) = oF (x + ag 3) 


where J is the moment of inertia. An alternative way of writing this is 


1 
H=—— : 2 4 p27, 
dTant 9 (in Po)" + Pol 


Carry out the quantum substitutions 


ee 


Which one of the above Hamiltonians gives the correct Hamiltonian 
operator used in (4.52)? Verify that the correct operator is Hermitian. 
Comment on the results of this question. 


10 In the solution of the simple harmonic oscillator the wave function 
can be written in the form (4.30) where H(y) satisfies Hermite’s 
differential equation 


H" — 2yH' + (y¥ —1)H=0. 


Use the method of solution in series to obtain the first three terms of 
the two linearly independent solutions. 

The physically acceptable solution must be composed of a finite 
number of terms. Find this solution and explain why 


y=1+2n, n=0,1,2,... 


if y” is to be the highest power. in the solution. 
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CHAPTER 5 
Quantum Mechanics I] 


5.1 Complete Description of a State 


It has been mentioned that in a quantum mechanical system not all 
observables are simultaneously measurable. The x-co-ordinate of a 
particle and the x-component of its linear momentum cannot both be 
known exactly at the same instant. However there do exist sets of 
observables that can be measured simultaneously. For example it is 
possible to know simultaneously the energy and the total angular 
momentum of a particle in a central field. 

The state of a system is known as completely as possible when the 
values of all the observables that can be simultaneously measured are 
known. 


Compatibility 


If two observables A and B are simultaneously measurable in a particular 
state then if either is measured, an unique result is obtained. This implies 
that the state function is an eigenfunction of the representative operators 
a and 8, (Postulates four and five). 

The observables A and B are said to be compatible if there exists a 
complete set of state functions, each of which is an eigenfunction of the 
corresponding operators. 

The following two theorems illustrate the connection between com- 
patible observables and commuting operators and lead naturally to a 
discussion of degeneracy. 


Theorem I 


If the operators a and 8 have a complete set of simultaneous eigen- 
functions then the operators commute. 
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Proof 


Let w; be one of the complete set of eigenfunctions of the operators a 
and 6 with corresponding eigenvalues a; and b; 


oy, =a; By; = bij. (5.1) 


Since the set of simultaneous eigenfunctions {w;} is complete an arbitrary 
wave-function can be expanded 


7] =D cf Wj. (5.2) 
i 
Consider the effect of the commutator of — Ba on y. 
(a8 — Ba)W = (a8 — Ba) Dc Wj. (5.3) 
j 
From (5.1) this can be written 
(08 — Ba) = > cj (bja; — a;b;)¥; 
j 
=0, (5 4) 
Since yp is arbitrary then 


(06 — fa) = 0. (5.5) 


The commutator representing the co-ordinate q; and its conjugate 
momentum p; is 


ha 
is >| =ih 5.6 
E i =| u (5.6) 


and clearly q; and p; are not compatible. 


Theorem 2 


If the operators a and 8 commute, there exists a complete set of functions 
which are simultaneously the eigenfunctions for both a and 8. This 
implies that the observables A and B are compatible and this theorem is 
the converse of theorem one. 


Proof 


Consider first the case when the operator a has non-degenerate eigen- 
values. It will be shown that every eigenfunction of a is also an eigen- 
function of 8 if « and B commute. Let w; be one of the complete set of 
non-degenerate eigenfunctions for the operator a, 


a; = aj. (5.7) 


QUANTUM MECHANICS II 103 


Operate on this equation with B and as a and 6B commute 
Bay; = a;(BY;), 
a(B;) = a; BY;). (5.8) 


The function Sy; is an eigenfunction of a with eigenvalue a;. As the 
eigenstates of a are by assumption non-degenerate then clearly 


By; = (constant) p;. (5.9) 


This implies that y; is an eigenfunction of 8 and the result is proved. 

Suppose now that a has degenerate eigenvalues. Let W; be one of the 
n linearly independent eigenfunctions spanning the n-dimensional sub- 
space associated with the degenerate eigenvalue a. 


ay; =ay; i= 1, ee vg lle (5.10) 


Let this sub-space be V(a). Any eigenfunction of a with eigenvalue a 
must lie in this sub-space and vice-versa. Operate on (5.10) with 8 and 
as a and 8 commute 


a(By;)=a(By;) i=1,..40 (5.11) 
(BW;) is an eigenfunction of & with eigenvalue a and must belong to V@). 
n 
BY; = 2 aij Wj. (5.12) 
i= 


That is, V(a) is ‘invariant’ under § and this is true for each eigenvalue 
sub-space of a. 

Since the eigenfunctions of a form a complete set then any eigen- 
function of 8, with eigenvalue b say, can be expanded 


n 
2 ki to (5.13) 
i= 
where the sum represents a function in V(@) and ¢ isa linear combination 


of the eigenfunctions of « outside V(@). Since each eigenvalue sub-space 
of a is invariant under B then the linearly independent functions 


x= 3 ki (5.14) 
and ¢ are each eigenfunctions of 8. 

Bo = bo (5.15) 

Bx = bx. (5.16) 


From (5.14) x is simultaneously an eigenfunction of a with 


ax =ax. (5.17) 
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The function ¢ can be further decomposed into its linearly indepen- 
dent components in the other eigenvalue sub-spaces of a, although some 
of these components could be zero. So each of the eigenfunctions of 8 
can be constructed from a single eigenvalue sub-space of a. Since Bis an 
operator representing an observable its eigenfunctions form a complete 
set and the theorem is proved. 

In particular there must be n simultaneous eigenfunctions of a and 8 
that span V(@). They all correspond to the eigenvalue a of & but in 
general will belong to different eigenvalues of 6. 

As an illustration, consider the operators representing the three 
cartesian components of linear momentum of a particle. 


=, = == (5.18) 


These operators commute with one another and by theorem two there 
must be a complete set of functions which are simultaneously eigen- 
functions of all three. The eigenfunctions are 


; Px 
1 
W(x, y, z) = exp j (PxX * Pyy + Dz2) —e<py< (5.19) 
Pz 


where px, Py, Pz are the eigenvalues. 


Degeneracy 


For a free particle the wave-function (5.19) satisfies Schrédinger’s time- 
independent equation and is a possible state function. The values of the 
momentum components px, Py, Pz determine the wave-function, apart 
from the usual arbitrary wave factor, and the three compatible momen- 
tum observables form what is called a complete set. 

The commutation of operators is closely related to the occurrence of 
degeneracies. In proving theorem two it was shown that from the set of 
eigenfunctions W1,.. ., W, of « which span the n-dimensional sub-space 
of the degenerate eigenvalue a, it is possible to construct 7 linearly 
independent functions 


xis oy Ky 7 ly een (5.20) [(5.14)] 


which are simultaneously eigenfunctions of a and 8 if the commutator 
[a, B] is zero. 


ax! =ax/ (5.21){(5.17)] 
Bx! = bly! (5.22){(5.16)] 


The superscript j is introduced to distinguish the n-eigenvalues b’. 
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As the operators a, 8 commute, the associated observables A and B 
are compatible and may be measured simultaneously. Tf on such a 
measurement the values a, b/ are obtained then immediately after, the 
state function is essentially x/. If the eigenvalue b/ is non-degenerate the 
wave-function is determined completely. On the other hand, if b/ is, say, 
m-fold degenerate (m <n)x/ must be a function in the associated m- 
dimensional sub-space. In this case, experience shows that another 
observable C may be found whose operator ‘y commutes with both a and 
B and so may also be measured. After this measurement is carried out the 
wave-function is specified more completely. In this way the dimensionality 
of the sub-space, in which the wave-function is known to be, is reduced. 
More compatible observables may be found and measured until the wave- 
function is specified completely. 

Eventually a ‘complete set’ of commuting observables is found so that 
when the eigenvalues of each are specified, the wave-function is known 
completely, apart from a phase factor. No other observable can be 
measured unless it is a function of those already in the set. 

For a given system there may be several distinct complete sets of 
observables. All the members of one such set do not commute with all 
the other members of any other set although they may have some 
members in common. 

There is a very interesting alternative but equivalent way of discussing 
degeneracies. This involves a study of the symmetry properties of the 
system and is dealt with in Chapter 9. 


Angular momentum quantum numbers 


It was shown in section 4.3 that the motion of an electron in the field of 
a nucleus is essentially described by the Schrédinger equation 


2 
2 mi Vu + VW HEY, (5.23)((4.61)] 


where the nucleus is at the origin, u is the reduced electron mass and E is 
the internal energy of the system. The potential of interaction is spheri- 
cally symmetric and is a function of r only. It will be shown below that 
in this central field problem three observables form a complete set. 

Classically, the angular momentum about the origin of a particle with 
linear momentum p at the position r is 


L=rAp. (5.24) 
The z-component of this orbital angular momentum is 


L, =XPy — YPx- (5.25) 
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The quantum mechanical operator #, representing this observable is 
obtained using the substitutions (3.8). 


BoP Oe 20. 
F= P ( ay ees | (5.26) 


(As there are no non-commuting factors in (5.26) there is no ambiguity.) 
Similarly, operators may be obtained for the other components. 


ala 2 
$,= = {yo 2 
ay (2 x) 

_h{ a a 


It is easily shown that the operators (5.26), (5.27), do not commute with 
one another and so the observables L,, Ly, L, are not compatible 
(Theorem 1). For example 


h \? 7) 3 7) r) 

2.8 92(?)[bx-75) Fae) 
AY Meee 
~\7 ax * a2 Yaz 7 oy 


a a 
a 2 — _ 
=—h (2 x2) 


i.e. 

P2yLy —- Ly Ly, =ih L,. (5.28) 
Similarly, 

Ly L,-— £,L£,=in£, 

L, Ly, —- Ly $,2=ihZL,. (5.29) 


However, all three angular momentum operators do commute with 
the Hamiltonian of (5.23). In particular, if all derivatives concerned are 
continuous, 


L,V? -—- V? $,=0 (5.30) 


and 
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ie. 
£,V(r) — Vir) £7, = 9. (5.31) 
As the Hamiltonian operator is 


nr? r 
H=—-— VA +V 
2u © 
then 


LH — HL, =0. (5.32) 


From theorem two the sets of observables (H, Lx), (H, Ly), H, Lz) are 
compatible. 
The magnitude of the total angular momentum is 


1? =£2+13 +22 (5.33) 
and the operator representing this observable is 
L=L2+ G2 +F}. (5.34) 


As £,, Ly, L, all commute with H# then so does F?, e.g. 
LEH = LA HL,) = HL: 
L2H ~HL*=0. (5.35) 
Similarly Px, Ly, Lz all commute with 2 2 eg. 
LiL? = Lr Let LiL + LL; 
= (nL, + Le L,) Ly + (—in Ly + Ly Lz) Ly+ Li 
= th( Ly Ly — Ly Ly)+ LAL FL, +in Ly) 
+ Ly(Ly Lr —ihL,)+ L? 
=L22,+032L,+£} 
1.¢€. 
L,L? — L* L,=0. (5.36) 
Clearly, then the sets of observables 
H, L’, Ly @ 
H, L?, Ly (ii) 
H, L’, Lz (iii) (5.37) 


are compatible and in fact each is complete. 
As in section 4.3 it is useful to work in spherical polar co-ordinates 
defined by 


x =rsin @ cos ¢ y =rsin@ sind z=rcos 6. (5.38) 
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The operators (5.26), (5.27) and (5.34) become | 


_ Af. ,2@ r) 
L£,=- ; (sno X +cat cos63) 


h 3 ro) 
2-4 (-coso Z+coto sno?) 


ag 
ha 
te 45 (5.39) 
1 3a a 1 2? 
Ff? =_p7? | —— — [sin 9d — | +—>— —3}. : 
f fe ry (in 955] sin? 0 =| G40) 


The eigenfunctions of the Hamiltonian for central field problems are 
essentially 


Vnim (ts 9 $) = Rn, (P)Pi"! (cos 0) ec", (5.41) [(4.72)] 


(In the special case of a Coulomb potential the radial part may be 
identified as a Laguerre function (4.70), and the energy depends only 
upon n). 

Clearly these energy eigenfunctions are also eigenfunctions of the z- 
momentum operator Y, with eigenvalues hm;. m, is called the magnetic 
quantum number. Similarly by comparing (5.40) with (4.64) it is seen 
that Vpzm, is an eigenfunction of the total angular momentum operator 
£7? with eigenvalue h?1(I + 1). 1 is called the angular momentum number. 
Measurement of the observables L? and L, determines the angular part 
of the wave-function and measurement of the energy determines the 
radial part. 

Observe that Wnim; is not an eigenfunction of the operators F,, Ly 
except in the case that the quantum number / = 0. Then the wave-function 
depends upon r only and Y,, Ly, £, have the common eigenvalue zero. 
(Note that the commutation rules (5.28), (5.29) are satisfied.) 


§.2 Constants of the Motion 


It is well known that there are important conservation laws in classical 
mechanics. The rate of change of a classical variable F(q1,.-., n> 


P1,---DPn, t) is given by 
a ee (1.34) 


where {F, H} is the Poisson bracket of F and the Hamiltonian. When H 
does not depend explicitly on the time dH/dt = 0 and the Hamiltonian is 
a constant of the motion. Any time independent variable of the system 
whose Poisson bracket with the Hamiltonian vanishes, is conserved. 


| 
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Another example is the conservation of angular momentum of a particle 
moving in a central force field. The Hamiltonian is 


1 
es (p2 +p? +p7)+ V0) (5.42) 


where the potential energy is spherical symmetric when the origin is 
suitably chosen. The z-component of angular momentum is 


L, =xPy — YPx 
and it is a simple matter to show that the Poisson bracket 
{L,, H}=0, (5.43) 


and so L, is a constant of the motion. A similar argument applies for Ly 
and Ly and consequently the vector angular momentum is conserved, 

There are analogous results in quantum mechanics. In a given state 
described by a normalized wave-function V(q1,..-n: t)a general 
observable does not have a definite value. The expectation value of the 
observable, A is given by 


(A)= f Va dr (3.21) 


where a is the representative operator. This expectation value may not 
be constant in time and the rate at which it changes is given by 


d da ove ow 

—(A)=| | o* = v + — ob + ba — [dr. : 

7) | Se + ad + ta lar (5.44) 
The state function must satisfy Schrddinger’s time-dependent equation. 

ow i ov* i 

—=- HY — =tH * 

a A yi A (#Y) (5.45){G.10)] 


and as the Hamiltonian operator is Hermitian 
{ CA wy (ew) ar = { veoto ar. (5.46) 
Substitution of (5.45) and (5.46) into (5.44) gives 
d da i 
—(A)= | vt) —+— = ; : 
6A) Jur[ Bek te aan} war (5.47) 


This equation is the quantum analogue of (1.34) and may be used to 
define an operator representing the time dervative of A, 


da i 
oF + ry (Ha — aH). (5.48) 
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This illustrates a point already mentioned (3.6) that in-the transition 
from classical mechanics to quantum mechanics Poisson brackets are 
replaced by commutators. 

If the operator a does not depend explicitly on the time and if it also 
commutes with the Hamiltonian of the system 


d 
a {(A)=0 


and the expectation value of the observable A is a constant of the 
motion, A is said to conserved. In particular, if the state function is also 
an eigenfunction of a, then A has the definite value equal to the eigen- 
value in that state. The state function is an eigenfunction of any given 
representative operator immediately after a measurement of the appro- 
priate observable. However the state function will remain an eigen- 
function only if the observable is conserved and the associated quantum 
numbers are called ‘good quantum numbers’. 

For most systems the Hamiltonian does not depend explicitly on the 
time and its expectation value is a constant. 

If the state function is an eigenfunction of the Hamiltonian operator 
at some instant it will remain so. Such a state is called a stationary state 
and the wave-function has the form 


V@, +9 On t) = v@1, ee-69 dn)e tin (5.49) 


It is always possible to measure conserved observables simultaneously 
with the Hamiltonian. In the central field problem the observables (H, 
L?, L,) form a complete set with values which are constants of the 
motion. The quantum numbers 2, J, m; are good quantum numbers. 


5.3. Uncertainty principle 


Consider two observables A and B with representative operators a and 8. 
If « and 8 commute both observables may be measured simultaneously. 
If the operators a, 8 do not commute they do not have a complete set of 
simultaneous eigenfunctions, (Theorem one). Define y by 


(08 — Ba) =iy. (5.50) 
There are two possible cases. 


(i) If y is a Hermitian operator it may be possible to measure A and B 
simultaneously in certain states. If y; is a simultaneous eigenfunction 
of a and 6 then it must also be an eigenfunction of y with zero eigen- 
value. Even though there may be an infinite number of such eigen- 
functions they cannot form a complete set. 

(ii) If y is a real constant then 


yv; =0 
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has no non-trivial solutions and a, 6 do not have any eigenfunctions 
in common. A well-known case is that of the position co-ordinate 
and conjugate momentum 


iP; — Bias) = ih. 
The position and momentum of a particle can never be known 


simultaneously. Any measurement of the position must disturb 
the momentum and vice-versa. 


Even so, it is possible to obtain a relation between the uncertainties in 
the measurements. 


Schwarz inequality 


If y, and W2 are two quadratically integrable functions then 


[Jvtvs ar][f vive ar] >| [dvi ar” (6.51) 
This is analogous to the result 

|AI?(BI? >| A. Bi? 
where A, B are vectors in three-dimensional Euclidean space. Hilbert 


space is a unitary space and the integral f WW, dr represents the 
‘scalar’ product of y3 and Wy. 


Proof 


Let A be a real constant defined such that the function x2 = AW is 
normalized to unity. 


A? =1/{ bal? ar. 
Consider the function 
&=0,-xX2 [xdvs dr. (5.52) 
Since f |®[? dr >0 then 
five dr —2|[x2v dr|? + (xtvi ar)? >0 
1,.€. 
f wi ar> | fxtvi ar’. 
Substitute for x2 to obtain 
[iil ar [ival® ar> | fut, ar]. (5.53) 


| 
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Uncertainty relation 


When a system is in a state described by the normalized function wy the 
uncertainty (AA) is a measurement of the observable A is defined by 


(a4)? = | y* fo —<A)]?y dr (5.54) 
where the expectation value of A is defined as usual by 
(A= | vray dr 


(5.54) may be expressed in another form by expanding yw in terms of the 
eigenfunctions of a. Writing 


w=DZev; with ab; =a;y; 
j 
(AA)? = & lejl? fa; — (A)? 


(4A)? is the variance of the possible results of measuring A. 
In the Schwarz inequality write 


W, =[a-—(A]y and V2 =([B-—(B]v 
where § is the operator representing the observable B, The integral 


[iva ar=f (@- (Av) *[@— Ay] ar 
= [ v*(@— (Ay ar 
since (a — (A)) is Hermitian, i.e. 
fiviP dr =(AA)?. 
Similarly 
flvol? ar = (az) 


Since the square of the modulus of a complex quantity is greater or 
equal to the square of its imaginary part then 


| [viva arf’ >4| futur ar— J vtve ae/? 
>4|{ v*(os— bayy ar]. 
The Schwarz inequality (5.51) gives 
(a4)?(0B? >4| { v*(a6 — Bay ar|”. (5.55) 
If the state function is an eigenfunction of both a and £ then 


(A.A)?(AB)? =0 
as expected. 
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In the more general case when [a, 8] =i7 


(4A)(AB)>4| J vey ar. (5.56) 


This is a general statement of the Heisenberg uncertainty relation. In 
particular, the uncertainty relation for the momentum and position of a 
particle is 


(Aqi)(Api) *, (5.57) 


This confirms a point already mentioned in Chapter 4 that when ? can be 
neglected, classical mechanics can be used. 


- 
oO 
-_— 
- 
~-—— 


wanes 


“Tncident H 


plane wave 


Figure 5.1 Illustration of the uncertainty principle. 


There are several simple experiments which illustrate the uncertainty 
principle. Suppose a particle with a known momentum is incident 
| normal to a screen which contains a single slit. The wave-function of the 
| particle is the plane wave 


y = eikx 
and the momentum is p,, =k. (The co-ordinate axes are shown in (5.2).) 
Before reaching the screen, the position of the particle is completely 
uncertain but the component of momentum parallel to the screen is 


| known to be zero. After passing through the slit the y-co-ordinate is 
known with an uncertainty essentially equal to the slit width a, ie. 


Ay =a. 


114 A FIRST COURSE IN QUANTUM MECHANICS | 
| 
In discovering this information about the particle position an element of 


uncertainty is introduced into the conjugate component of momentum 
Dy. After passing through the slit the ‘particle wave’ undergoes diffrac- 
tion and there is a finite probability for the particle to be deflected 
through an angle AQ. It is well known that the first minimum in the 
diffraction pattern occurs when 


d is the ‘wave-length’ of the particle wave and is equal to h/p. A 
measure of the uncertainty in the momentum is clearly 


Ap, = pAé 
h 


a 
AyApy ~h. (5.58) 


Note that if the slit is made smaller so that the position is known with 
greater accuracy there is a corresponding increase in the uncertainty of 
the momentum. 

The lowest energy level for a system is consistent with the uncertainty 
principle. For example, consider the particle confined in a cube of side 
a. The position uncertainty Ax ~ a implies a momentum uncertainty 
Ap, ~ h/a which in turn implies a minimum energy ~h?/ma?. This 
should be compared with (4.43) where the lowest non-trivial-state 
corresponds to/ =1,m =n =0. 

As interesting applications consider the relative energies of an atomic 
electron and a nuclear a-particle. For the atomic electron a sensible 
characteristic length is 1 A. If Ax ~ 107!° m then Ap, ~ 10° # and 
this implies an energy ~10 eV. For the dimension of the nucleus take 
Ax ~ 1074 m then Ap, ~ 1014 h and this implies an energy ~1 MeV. 
Atomic electrons do indeed have energies sensibly measured in terms of 
eV and the kinetic energy of emitted a-particles is of the order of MeV. 


5.4 Wave Packets 


A ‘wave packet’ is a wave-function that attempts to give an almost 
classical description to a particle so that it is possible to give some 
meaning even if uncertain, to the momentum and position simultaneously. 
The one-dimensional wave-function B exp [i(kx — wt)] represents a 
particle with momentum hk and energy hw. The momentum eigenvalues 
form a continuous spectrum and a superposition of them is represented 
by an integral. 


V(x, )= f Bk) &&*—&) ak (5.59) 
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| 
|B(k)|? dk represents the probability that a measurement of the 
momentum will give a value such that 


dk dk 
es eee +— |]. 
n(t pak ) 


If the wave-function W(x, t) is to represent a particle with a characteristic 
momentum it is necessary that B(k) be vanishingly small except for a 
small range of values of k. 

Suppose B(k) has an appreciable value only near k = Ko. The energy 
and hence the ‘frequency’ w are functions of k and may be expanded in 
Taylor series about Ko 


dw 1 dw 
Clk) = Wo + (k — ho) Ge t7 Eko) Gaz k~ko 
where 
dw (dw d2w (d?w 
= w(ko) =| 5p c2=(<S) . 6.60 
We = wl o> ake (22), and aK (2), (5.60) 


The wave-functions becomes 


W(x, t) = ellkox Hot) j B(k) exp ~ ko) (» as 7 
Bs dko 
Lg k pie, dk (5.61) 

ey OR ake : : 


If w(k) is a linear function d?w/dk2 = 0 and the wave packet does not 
disperse but travels without change of shape. In this case defining 


f@)= j B(k) ef &—*o)™ dk (5.62) 
then 
W(x, t.)= flx— GO | Gihox~ O08), (5.63) 
dko 
In particular 


W(x, 0) = f(x) ef*o* 
Since B(k) is appreciable only in the neighbourhood of k = ko, f(x) 
must be a relatively slowly varying function of x and is non-zero in the 


neighbourhood of x = 0. Equation (5.63) then represents a plane wave 
with a modulated amplitude 


dw 
re S 
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and the ‘wave packet’ moves with a group velocity 
Ve.=—. (5.64) 


For a free particle 
a?k? 


dw hkg po 
d —_ FE. 
2m aneso dkg m am 


The packet velocity is equal to the velocity corresponding to the mean 
value of the wave number Ky. 

The wave packet description makes it possible to associate a mean 
position and momentum with a particle but the finite size of the packet 
makes it impossible to specify the position exactly and the spread in the 
k-values needed to construct the packet make it impossible to specify 
the momentum exactly. This is in accordance with the uncertainty 
principle. 

The above treatment is readily extended to three dimensions and the 
wave packet is then 


Ur, 1) = f ff B(k) ef(#-t— #8) dk (5.65) 


with an associated velocity 


Vy =grady w(k). (5.66) 


Ehrenfest’s theorem 


In 1927 Ehrenfest showed that the motion of a wave packet obeys 
Newton’s laws. He showed that if (x) and ( p) are the average values of 
the position and momentum for the wave packet then 


d 
mo 6x) = (Px) (5.67) 


and if Vis the potential energy of the particle 


de ya (OY 
(pa) -(), (5.68) 


Proof 


Let W(r, £) be the state function describing the motion of a particle. 
Assume it is quadratically integrable and normalized to unity. The 
expectation viue of the x-co-ordinate is defined by 


(x)= [ Weew dr (5.69) 
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and the average value of the x-component of the velocity is (5.48) 
i 


, fvttoex ~ xH\WY dr. 


d 
a {x)= 
The Hamiltonian operator 


hn? 
H=-—V*+VO®) 
2m 


and so 
d ih 
Sem = * 2 _ *. 2 
ae? am {ov (V2x¥) — U*x VOW] dr 
ih 2 2 
=_ —— |r] x V20 +2 — —xV' | dr 
m 
1 [ys (? ay 
m i ax 
ie. 
d 1 
— (x) =— . 5.70 
ae mex (5.70) 


The rate of change of the expectation value of the x-component of 
momentum is 


d 3 ) 
— = * —_— — wy 
<(pd=|¥ cae 2 2 dr 


3 
=| u (-2} dr (5.71) 
ox 
i.e. 
d av 
57 (Pe x) (5.72) 


For a sharply defined wave packet (x) and ¢ px) can be associated with 
the classical meaning of position and momentum of the particle and 
— (8V/ax) can be identified with the classical force component. The 
packet then behaves like a classical particle. 
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5.5 Probability Current Density | 


In the previous section it has been shown that the operator representing 
the velocity is the momentum operator divided by the mass, i.e. Hi/im) 
grad (5.70). This operator can be used to find the expectation value of 
the velocity for a given state function. 

Arelated quantity is the probability current vector § which can be 
used to find the probability that a particle will cross a given surface in 
unit time. Suppose (r, t) is the state function representing a one-particle 
system. The probability that the particle is in the finite volume QQ is 


p(Q)= i) Wh dr. (5.73) 
2 
The rate of change of this probability is 
dp(Q) = ~ 
a Jar W +r dr (5.74) 
2 


and as W satisfies Schrédinger’s time-dependent equation 


dp ve =( [VOPY)* — v*(QPW)] dr. 


The Hamiltonian operator is 


hh? 
H=-—V4+v 
2m 


and so 
a2 
U(HV)* — UX) = — om [WV20* — PF V2 WH]. 
Using the vector identity 
div [Y grad W* — W* grad W] = WV? Yt — vFV? DY 
the rate of change of p(Q) is 
Q 
se). _ - tn om {av [W (grad W*) — W* grad Y] dr. 
A vector function of position S may be defined by 
= jh [WY grad &* — W* grad ¥] (5.75) 
2m 
and then 


se) _ _ | div ar. (5.76) 
a 
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The volume integral can be transformed to a surface integral by Gauss’s 
theorem, ie. 


dp(Q) _ §s.da (5.77) 


where A is the surface enclosing the volume Q and the direction of dA 
is along the outward normal. Clearly from (5.77), the integral of S over 
the surface A is the probability that the particle will cross the surface 
going outwards in unit time. S is the probability current density. From 
(5.74) and (5.76) 


a 
57 CY*W) t div S=0. (5.78) 


This equation is analogous to the equations of continuity of hydro- 
dynamics and electrodynamics. Observe (5.75) that if ¥ is a real wave- 
function, the probability current density is zero. 

Consider the energy eigenfunctions for a plane wave moving in the 
x-direction. The wave-function 


W(x, y, 2) = VN el 


represents a density of N particles per unit volume. The probability 
current density vector has a non-zero component only in the x-direction 
and 


This is the expected result as the particles have momentum Wk and this 
expression gives the number of particles per second crossing unit area 
perpendicular to the x-axis. 

Amore interesting example is obtained by considering the central 
field eigenfunctions. From (5.41) these have the form 


wer, 8, 6) = FOr, 0) 


where f(r, 6) is a real function of 7 and 6 such that W is normalized to 
unity. The vector S has a non-zero component only in the ¢-direction 
ie. in the direction 0r/0@ about the z-axis. A simple calculation show 
that 


Sy = WP (5.79) 
br sin 6 


where is the reduced electron mass. The spherically symmetric s-states 
have zero current density. For the p, d states etc. equation (5.79) 
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represents an electric current circulating about the z-axis and it can be 
shown (Slater, p. 163) that the associated magnetic moment is 


ZT mM. (5.80) 


PROBLEMS 


1 A function of the operator a can be defined by 
f@= > kyo". 
n=0 


Show that the eigenfunctions of a are also eigenfunctions of f(a) and 
express the eigenvalue of the function operator in terms of the eigenvalues 
of a. 

Explain why if a is Hermitian then f(a) is also Hermitian. 

If the commutator of the two operators a, 6 is zero show that the 
commutator [f (a), 8] is zero also. 


2 The angular momentum about the origin of a particle with linear 
momentum p is given by 
L=rAp. 


By differentiating L with respect to time show that in the case of a 
central field when the force is directed towards the origin that L is a 
constant of the motion. 


3 Inthe case of a system of two particles the operator representing the 
x-component of the angular momentum about the origin is 


and similarly for &, and Ly (5.26), (5.27). 
If the masses of the particles are m,, mm the x-co-ordinate of the 
centre of mass is 


e. m,X, +m2X2 
my, +m 


with similar expressions for y and z (4.46). The co-ordinates of the 
second particle relative to the first are 


a =(x2 — x), B=(y2 —Y)) Y =@2 - 21). 


| 
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Verify that the given expression for -¥, can be transformed to give 


h ) 3 7) ) 
=— ——y—|tla-——b-]I.- 
%, i ( ay y 3.) ( ag 8 2)| 
Explain the significance of the two terms in the angular momentum. 


4 Show that if wis an operator and ¥, and W are two quadratically 
integrable state functions then 


d * = * aa i 
dt | [eters | bas {vi E + h (fa )| VY, dr. 


(Compare equation (5.47).) 


5 Show that if and @ are two Hermitian operators and , and W2 
are two quadratically integrable wave-functions then 


f vE(eBy2) ar = | Gays)*vo dr. 
Hence show that 
{ VF [08 — Ba] v2 ar = | [Go —08)¥ 1] *¥2 ar. 
Deduce that if an operator y is defined by 
(08 — Ba)=iy 
then y is Hermitian (5.50). 
6 It can be shown that for a linear harmonic oscillator the expectation 


value of the potential energy is equal to the expectation value of the 
kinetic energy. In particular, for the lowest energy state 


eH eT = 


As the expectation values of x and p are clearly both zero then the 
uncertainties in x and p are respectively given by 


ad’)? 
2x 2 2= slp 
(ox) =fytx?wdr, (Ap) Jue (] vdr. 
Show that in the lowest energy state 


(axy(ar)=5 
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in agreement with the certainty principle. Confirm that for the higher 
states 


(Axy(Ap) >. 


7 The function Yrim) (5.41) is an eigenfunction of Y, with eigenvalue 
am. The commutator of &, and Y, is 


LL, — Ly Ly =in L,;. (5.28) 
Using equation (5.55) show that 
h? |m| 


(AL, KALy) = =a 


8 Verify that the resuit of adding the two travelling waves 
yi =A cos (Kix — wyf) V2 =A cos (kx — wf) 


6k ) 
Y=y¥1 +2 =2A cos (3: - ‘2 ) cos (KoxX — wot) 


where 


6W = Wy, — Wo. 


This represents a wave with average frequency wo with a modulated 
amplitude. Verify that the modulation moves with a velocity 5w./5k. 
(Compare (5.63) and (5.64).) 


9 Verify that the wave packet 


W(x, #) = A 1 2) 


a [2? +a 
m 


is a solution of Schrédinger’s time-dependent equation in one-dimension 
for a free particle. Show that the expectation values of the position and 
momentum are both zero and derive the following expression for the 
variance 


wo (Q U8 
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This shows that a wave packet (in a dispersive medium where E(k)) 
spreads in time. You may assume the definite integrals, 


oo 


J xe dx=Vn/2 } e™ dx=vV/n. 


—oo —_oo 


10 Prove the vector identity 
div(p grad w* — W* grad W) = V2 y* — wt Vy 
where y is a scalar function of position. 


11 Let Wim; be an eigenfunction of the central field problem. The 
current density at a point in space is (from (5.79)) 
ehm, 


fad 2 
lo =r sin 6 |Wrtmyl”- 


The magnetic moment associated with a current carrying loop is equal to 
the current in the loop multiplied by the area of the loop. By first of all 
considering the magnetic moment in an annulus of small cross-section 
area about the z-axis and then summing over all such annuli show that the 
total magnetic moment is 


M=—m). 


2p. 


12 The Hamiltonian operator is, in spherical polar co-ordinates, 
n[1a/(.,4 1 2 a 
=—~—|>—(r* —]+5— = [sine 
ohne E ar ( 2 r sin @ 20 (in 2 


Perea eee ae + VO, Q ¢) 
r? sin? 8 3? ay? 
where u is the particle mass. 

Explain why, if the potential V is independent of @ and ¢ the energy 
eigenfunctions can also be chosen to be eigenfunctions of the total 
angular momentum operator #7. 

Show that the radial part R(r) of the variable separable eigenfunction 
of an energy state with angular momentum quantum number /, satisfies 


noid , dR nh? id+1) 
oe a —) +— ——— R+VRE=E. 
( 5) P R 'R ‘R 


where E is the state energy. 
Substitute R(r) = x(r)/r and obtain 


WP ax | 82? 


= 2 4 = 
Qu dr? 2 ur? ht ve] Ex. 
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Suppose 
V(r) = —Vo exp [—r/a}, Vo >0 


By substituting z = exp [—r/2a] show that the radial wave-functions for 
bound states with / = 0 are of the form 


A 
R= = Jy (ke"/*#) 


where A is a constant and 
Jy (kK) = 0. 


[J_ is the Bessel function of order v.] 
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CHAPTER 6 
Matrix Representations 


6.1 Introduction 


In the previous chapters the theory of quantum mechanics has included 
differential operators and differential equations. There is an equivalent 
alternative description in terms of matrices. Matrix mechanics was 
developed by Heisenberg, Born and Jordan and actually preceded the 
Schrédinger wave mechanics already described. In this chapter matrix 
mechanics will be deduced from the Schrédinger approach. 

An eigenvalue problem can be interpreted in terms of solving a set of 
linear equations. Suppose a is a Schrédinger differential operator 
representing an observable A and let y be an eigenfunction with eigen-) 
value a; 


ay =ay. (6.1) 


W can be expanded in terms of the complete set of orthonormal eigen- 
functions belonging to some other operator, say §;a discrete spectrum 
is assumed, i.e. 


V=2Zejd with — Bb; = BH. (6.2) 
i 
With this expansion for y, equation (6.1) becomes 
2 cGy =a 2.6) 9}. (6.3) 
j i 


If this equation is premultiplied by ¢; and integrated over all the space 
then 


> Cj Oyj = acy (6.4) 
j 
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where | 
ay = j or ag; dr. | 


Equation (6.4) is true for all / and represents a set of linear equations. 


QC, FQyoCog +... = acy 
41Cy $AQ2C2 +... =alg 


Q31C1 +039C2 +... =ac3 (6.5) 


As the operator 8 has an infinite number of eigenvalues then (6.5) includes 
an infinite number of equations. Sets of equations such as (6.5) can 
conveniently be represented in terms of matrices and it is useful at this 
point to make a brief digression to remind the reader of the essentials of 
matrix algebra. 


6.2 Matrix Algebra 


A matrix is a rectangular array which obeys certain rules of algebra, e.g. 


Ay 12 eee Qin 
A= 1 22... Agn (6.6) 
Qn, Am2.-- Amn 


aij is called an element of the matrix A, the subscript i denotes the row 
and the subscript j the column. The matrix (6.6) with m rows and n 
columns is called an ‘m by n’ matrix. If the number of rows is equal to 
the number of columns then the matrix is square. 

A matrix 


C1 
c2 


Cn 
composed of one column is called a column matrix or a column vector. 
Similarly a matrix with a one row 


(¢1, C2, oo oy Cn) 


is called a row matrix or row vector. 
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Two ‘m by n’ matrices are equal if and only if corresponding elements 
are separately equal. 

The two fundamental laws of matrix algrebra are addition and 
multiplication. 


Addition and subtraction 


The sum of two ‘m by n’ matrices T’'and A with respective elements yi 
and Aj is defined by the ‘m by n’ matrix A with elements 


and 
A=Tt+A 
e.g. 


01 1 3 
T=(2 1} a=fo 2 
0 3 1 4 


Addition is only defined for matrices that have the same number of rows 
and the same number of columns. 
Substraction is the reverse of addition and the matrix 


==T-A (6.8) 
with 

Oi = Vij — Nj 
is called the difference of T and A. 


From the definition (6.7) it is obvious that the addition of matrices is 
both commutative and associative, i.e. 


T+A=A+T (6.9) 
and 

(AtT)+A=A+(T+A). (6.10) 
Multiplication 


The product of the matrix I and the constant d is defined to be the 
matrix dT with elements 


(aD) jj = ayy. (6.11) 
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The product A = TA of two matrices is defined only when the number 
of columns of Tis equal to the number of rows of A. The elements of 
the product matrix are given by 


Oy = 27 My (6.12) 


e.g. 


0101 
A=TA=|2 5 6 1). (6.13) 
03 03 


Observe that the number of rows of A is equal to the number of rows of 
T and that the number of columns of A is equal to the number of 
columns of A. Matrix multiplication is not commutative, i.e. 


TA#AT in general. (6.14) 
For example 
1 1 1 2 
r= ( A= ( (6.15) 
0 1 1 1 


then 


2°.3 1 3 
ra-( and AT -( } 
1 1 1 2 


However matrix multiplications is associative, i.e. 


(AT)NA= ACA). (6.16) 
Also multiplication is distributative over addition. 
A(I'+ A) =AT'+ AA. (6.17) 


A matrix whose elements are all zero is called the zero or null matrix 
and is denoted by 0. 
An important result is that 


TA=0 
does not necessarily imply 
T=0 or A=0 
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| eg. 
1 1i\/-1 i) (0 0 
(/ 1 ( 1 ») -(; *} 
It is now clear how sets of equations can be represented by matrices. 


The equations 


O44Cy FOy2QCq +... FA inly =O 


OC, + %m2C2 +--+ Omnln = an 
can be written 


Ac=a 


Oy Qy2 ..-Ay, 
A= ; 
Amt %m2---Amn 


and c and aare the column matrices 


where 


Cr a 

C2 a4 
c= a= 

Cn an 


The transpose of a matrix 


Suppose I is an ‘m by n’ matrix with elements i. The transpose of this 
matrix is an ‘2 by m’ matrix I’ such that 


| Dy = 1% | (6.18) 


The columns of I become the rows of T and vice-versa, e.g. if 


1 2 
is 10 2 
T=[0 1 then r-( 
2 1 °=#0 
2 0 


Clearly the transpose of a transpose is the original matrix, i.e. 


(f) =P. (6.19) 
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The transpose of a product is the product of the transposes in 
reverse order. 


(TA); = (TA); 


7 Oy (A); 
= 2A 1, 
= (Ar ij 
i.e. 
(PA) = AR. (6.20) 


Square matrices 


A matrix that has the same number of rows and columns is called a 
square matrix. 

If a square matrix is equal to its transpose it is called a symmetric 
matrix, ie. if 


r-Fr (6.21) 
eg. 


A square matrix whose only non-zero elements lie in the principal 
diagonal is called a diagonal matrix, e.g. 


1 0 0 
02 0 
0 0 3 


All diagonal matrices of the same order commute and have diagonal 
products. 

A diagonal matrix whose elements in the principal diagonal are all 
unity is called a unit matrix and is denoted by I. The unit matrix of 
order three is 


1 0 0 
1=}0 1 0 
0 0 1 
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Trace 


The trace (or spur) of a square matrix is the sum of the diagonal elements; 


Trace T = > yi. (6.22) 
i 


The trace of a product of two matrices does not depend on the order 
of the factors. 


Trace (TA) = 2 (Sry rm) = 2 (5 evi) 
i.e. 


Trace (FA) = Trace (AT). (6.23) 


The inverse of a matrix 
The inverse of the square matrix T is the matrix I~ such that 
Tr? =Pr're. (6.24) 


This inverse exists if and only if the matrix I'is nonsingular; that is if the 
determinant of the elements of I’ is non-zero. In this case the inverse is 
obtained from F by replacing each element by its cofactor and then 
transposing the resulting matrix and finally dividing it by (det YP), e.g. if 


i) om ret(E (9) 


If Tis a diagonal matrix, the inverse is also a diagonal matrix whose 
non-zero elements are the reciprocals of those of T. 


Yu Oo... 0 VWru 0... 90 
0 Yo2- + 0 1/Y¥22 - 
T=/| . T l= i hs 
ee Oitiagca ee, UR 
(6.25) 


The inverse of the inverse of a matrix is the original matrix. To prove 
this consider 


rr =i. 
Put P= A™™ then 
ANAT)? =L 
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Premultiply this equation by A and then 
(A')'=A (6.26) 


as required. 
The inverse of a product is the product of the inverses in reverse order. 
If (PA) is the inverse of (TA) 


(TA)(TA)™ =1. 
Premultiply this equation by I. 
ATA)? =r". 
Now multiply by A™ and 
(TA)? =a7r (6.27) 


as required. 


Complex matrices 


The matrices that occur in quantum mechanics generally have complex 
elements. The complex conjugate of the matrix T'is denoted by I and 
has elements such that 


(*)j = Qj. (6.28) 


A square matrix I whose transpose is equal to the complex conjugate 
is called Hermitian or self-adjoint, i.e. 


r=r*. (6.29) 


Clearly the elements in the principal diagonal of a Hermitian matrix 
must be real. If all the elements are real, the matrix must be symmetric. 

A square matrix I whose tranpose is equal to the complex conjugate 
of the inverse is called a unitary matrix, i.e. 


r= re). (6.30) 


A real unitary matrix is called an orthogonal matrix. 


Eigenvalues 


Let A be a square matrix with 2 rows and c a column vector withn 
elements. In general the product Ac will produce another column vector. 
However in some special cases this resultant column vector is simply a 
multiple of c. 


Ac = ac. (6.31) 


This is a matrix eigenvalue equation. The null vector c = 0 is a solution 
for all finite 2. A value for a for which c ¥ 0 is called an eigenvalue of the 
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matrix A and c is the corresponding eigenvector. Equation (6.31) 
lrepresents the n equations 


OLyyCy FO yQCo +... FO yyy = Cy 
Oni Cy tOnge2 +... + Anny = aly (6.32) 
i.e. 


(04, — @)cy FOy2C2 +... + Mi nCy =O 


Oni1 +On202 +... + (Onn — 2)en = 9. (6.33) 


This system of n homogeneous linear equations has a non-trivial solution 
if and only if the characteristic determinant is zero. 


yy —~A2 = AyQ~.. Ain 
det(A—al=| . =0. (6.34) 


On1 An2 Ann — a 


This determinant can be expanded to give a polynomial of nth degree in 
a: the characteristic polynomial of A. The n roots of this polynomial are 
the eigenvalues and may be real or complex. For every eigenvalue, the 
equations (6.32) have a solution giving the corresponding eigenvectors. 
However it should be observed that each eigenvector can be multiplied by 
an arbitrary constant and still remain an eigenvector. Consequently it is 
always possible to define normalized eigenvectors that satisfy 

c*c = 1. (6.35) 


As an illustration consider the two by two matrix 
a() o} 
The eigenvectors of A are the solutions of the simultaneous equations 
—ac, tc, =0 
C4 — acy = 0. 


These equations have a non-trivial solution only when 


—a 1 


1 —-a =q*—1=0 


| 

| 
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The corresponding normalized eigenvectors are | 


“aall) val} 


Theorem 1 


The eigenvalues of a Hermitian matrix are real. 


Proof 


Suppose A is a Hermitian matrix and a is an eigenvalue with non-zero 
eigenvector c. 


Ac=ac (i). : 
Premultiply this equation by ¢* 


c*Ac=ae*e (ii). 


From (i) 
A*c* =a¥*c*, 
Premultiply by ¢ 


CA*c* = a *Ec*. 
As A is Hermitian 
@Ac* =a*éc*. 
Transpose (using (6.20)). 
c*Ac a*é*c 
and consequently by comparison with (ii). 
a=a* (6.36) 


and the eigenvalue is real. 


Theorem 2 


The eigenvectors of a Hermitian matrix belonging to different eigenvalues 
are orthogonal. 


Proof 
Let A be a Hermitian matrix with eigenvectors cy, ¢,, such that 


Acy =anCy (i) ACm =@mCm (ii). 


MATRIX REPRESENTATIONS 135 


i Premultiplying (i) by om 
a Achy =anemCn- 
Also since A is Hermitian 

Bh, Mey = Oh Aen = (A* On En 

=AmemCn 

aS dy, is real. 

@n - am)&m Cy, = 0 
and as a, Fam 

Gen =O nem. (6.37) 


That is the eigenvectors C,,, Cy are orthogonal. 


6.3. Matrix Mechanics 


A quantum mechanical eigenvalue problem can be stated in differential 
equation form (Schrédinger) or in matrix form. If a is a Schrddinger 
differential operator representing an observable A, the eigenvalues @ are 
given by 


any; = iW. (6.1) 


It has been shown in section 6.1 that this can be represented as a set of 
simultaneous equations (6.5). Writing 


y= dX oj with BO; = b; (6.2) 
j 
then in matrix notation the equations (6.5) are 
Ac=ac (6.38) 
where 
144 %qQ--- 
021% 22 ‘ 
A= a with 00; = { 1 a; drt, (6.39) 
and 
C1 
C2 


c=]. : (6.40) 
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The orthonormal set {¢j} is called the basis of the representation and 
the column vector c is the representative of the wave-function y. 

It is assumed that the operator spectrum is discrete and the matrices | 
(6.38) have a denumerably infinite number of rows and/or columns. All 
the appropriate results given in section 6.2 for finite matrices will be 
taken over in the infinite case. The more general case when the spectrum 
is partly or wholly continuous is outside the scope of this book. The 
reader is referred to Mathematical foundations of Quantum Mechanics, 
by John von Neumann. 


Hermitian matrices in quantum mechanics 


As the operator @ represents an observable it must be Hermitian 
(postulate 2). Clearly 


Oj; = oy 
and the matrix A is Hermitian (6.29). It has already been shown in 


theorems one and two that such matrices possess real eigenvalues and 
that their eigenvectors belonging to different eigenvalues are orthogonal. 


The diagonal representation 


The matrix representing an observable can always be put into diagonal 
form. If the orthonormal basis used to expand the eigenfunctions of (6.1) 
is taken to be these eigenfunctions themselves, then the diagonal represen- 
tation is obtained. 


ay = [vitow, dr = 451. 


The matrix eigenvalue equation (6.38) becomes 


ay 0 Ons. Cin Cin 
0 a2 O. ses Con Con 


0 0 a3... C3n =an C3n |. (6.41) 


1 0 0 
0 1 0 
c= 0 >» 25 0 > Cy = (6.42) 


"—O-or:: 
33 
+ 
_ 
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| The diagonal matrix representing the Hamiltonian operator for a 
one-dimensional harmonic oscillator is 


400 
H=hw{0 # 0 
0 0 3 


and the eigenfunction with energy (# + 4)ftw is represented by the 
column vector 


0 0 
0 1 

cn * 0 n-1 
n 

nt+1 


Transformation of basis functions 


An arbitrary wave function y can be expanded in terms of a complete 
orthonormal set of basis functions which span the Hilbert space of the 
system, 


v= Dey | oe ar= Sy. (6.43) 
] 


The ¢;’s are analogous to unit vectors and the cj’s are the ‘components of 
the vector’ c. 

Of course there are different sets of basis functions apart from {9}. 
Suppose the orthonormal set {x;} also spans the Hilbert space. 

Clearly the members of the new set can be expanded in terms of the 
old set, i.e. 


X= Lopupi i=1,2,... (6.44) 
p 


where upi are the expansion coefficients and form an infinite matrix u. 
Similarly 


$= DXqhq «F112, .-- (6.45) 


q 


where vp; form an infinite matrix v. Substitution of (6.45) into (6.44) 
gives 


Xi = DXq % Ygptps 
q P 


> UgpUpi = 8 gi- 
p 
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In matrix language | 
vu =I, (6.46) 
Similarly, substitution of (6.44) into (6.45) gives 
uv =I, (6.47) 
From the definition of a matrix inverse (6.24) 
v=ul. (6.48) 
As both basis sets are orthonormal, then from (6.44) 
[xtx, dr = 64 = fz bp Upi 2 Ge Ug; AT 
ie. 
by = 2 upitp;- 


An element of the transpose matrix 0 will be denoted by Z;. As 


* =e 
Uni “ip 


then 
by = Litip Upj- (6.49) 


In matrix rotation 
l= t*u. (6.50) 


It has been shown that u has an inverse and if (6.50) is post-multiplied 
by u! 


ul =i*. (6.51) 


This final result shows that the matrix transforming one orthonormal 
set into another is unitary (see (6.30)). 

A change of basic functions is analogous to a rotation of co-ordinate 
axes in three-dimensional Cartesian space and as is well known such 
rotations are performed by orthogonal (real unitary) matrices. 


Transformation of a representative 


Obviously the representative of an arbitrary wave function y will alter 
under a change of basis functions. Suppose 


w= Dod (6.43) 
I 
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| in terms of the old set {¢} and 
w= fixi (6.52) 
t 


in terms of the new set {x;}. The members of the new set can be 
expanded in terms of the old set and substituting (6.44) into (6.52) 


y =¥ fi 2dpups= %(Zupifi) Gp. (6.53) 
U Pp p\t 


Comparison of (6.43) and (6.53) gives 
of = Luyf JHA, Zeeks 
i 


i.e. 
c=uf. (6.54) 


The old representative c is equal to the transformation matrix times the 
new representative f. 


Transformation of an operator matrix 


The infinite square matrix representing an operator @ must also alter 
under a change of basis. A(!) will be used to represent the matrix with 
respect to the set {4} and A(2) for the matrix in the {xj} representation. 
From (6.4) 


af z fo ag; dr, af?) = | XFOX; dr. (6.55) 
Using (6.44) 
of? =| 2 bpp ia 2 $attai dr 


=D upiaajahe = 2 Dip oe Uaj 
Pq Pq 


A = GFA. 
As the transformation matrix is unitary 
A® =yAMy, (6.56) 


This is a similarity transformation. 


Theorem 3 


A Hermitian matrix remains Hermitian under a unitary transformation. 
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Proof | 


Suppose the Hermitian matrix A@) is transformed into A@) by the 
unitary transformation u. 


AQ =u tay, 

Transposing 
A@ =qAMOT, 

As the matrix u is unitary and A“) is Hermitian 
A® 7 (ut AOy)* = a*(2) 


and so A() is also Hermitian. 


Theorem 4 


The eigenvalues of a Hermitian matrix remain invariant under a similarity 
transformation. 


Proof 


Suppose the operator a is represented by the matrix A“) in the original 
representation. The eigenvalue a satisfies 


AMc=ae (6.57) 


where c is the representative of the eigenfunction. If the basis functions 
are transformed using (6.44) the operator matrix undergoes a similarity 
transformation (6.56), so that 


A) = pAQyt (6.58) 
and the eigenvalue equation (6.57) may be written 

wAuc = ae. 
On premultiplying by u~! 

A@(u7c) = a(u7'c). 
From (6.54) this becomes 

AOE = af (6.59) 


where f is the eigenfunction representative in the new basis and the 
theorem is proved. 


Theorem 5 


The matrix of the product of two operators is the product of the 
separate matrices representing the operators. 
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Proof 


Suppose y and A are two quantum differential operators and that {9;} is 
a complete set of orthonormal wave functions. Clearly 


\ 


Ady = DAjid;. (6.60) 
Aji is an Serica of the matrix A representing A in the basis {¢,;} and 
hue i} OF Ad; dr. (6.61) 
Then 
YAO; = Dy; 
= 2Ay Myo 
j 1 
= 2 2 WHA (6.62) 
where 


Vy = } o; 70, dr 


and is a matrix element of T. 
Premultiply (6.62) by $3, and integrate over all the space. Then 


(mi = j Om AO; AT = 2 mpd: (6.63) 


and the theorem is proved. 

The matrices satisfy the same commutation relationships as the 
differential operators. [n particular the matrices representing the co- 
ordinates q; and the conjugate momentum pj satisfy 


h 
Pde — de BY = 154. (6.64) 


As diagonal matrices (of the same order) commute, then (6.64) 
implies that there is no representation in which pj and q; are simul- 
taneously of diagonal form. This of course is an expression of the 
uncertainty principle in that the operators p; and qj do not possess a 
simultaneous set of eigenfunctions. 


Angular momentum matrices 


Simultaneous eigenfunctions can be found for the angular momentum 
operators #? and Y, with eigenvalues /(/ + 1)#? and hm, respectively. 
These simultaneous eigenfunctions y (J, m;) form a complete set and an 
arbitrary wave-function can be expanded in terms of them. 


V(r) = 22 el, mv m). (6.65) 
mi 
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One choice for the basis functions is the orthonormal hydrogen type 
functions (5.41). 


WO, m1) =Rn PF" (cos 6) em, <1. (6.66) 


The matrix elements for the angular momentum operators in this 
representation are 


(L? imp t'my 2710 + V8 q Symp (6.67) 
and 
(Lz )im), my =7m)817' d mymy- (6.68) 


The matrices are both diagonal 


(6.69) 


(6.70) 


The matrices representing L,, Ly are not diagonal. 


The Schrodinger picture 


In Chapter 3 the basic postulates of the Schrédinger theory were 

given. The differential operators representing observables are in general 

time independent and the time-dependence of a state is included in the 

wave-function which undergoes a rotation in Hilbert space as time passes. 
The Schrédinger equation has an analogy in matrix mechanics. Let 

{¢;} be a complete set of time-independent orthonormal basis functions 

which can be used to expand an arbitrary state wave-function. 


W(r, 2) = D o(2)O(2). (6.71) 
] 


MATRIX REPRESENTATIONS 143 
As W(t, f) must satisfy the Schrédinger equation 


2 Hey (Oj) =ih > “a (1) (@). 
j 


Premultiply this equation by ¢; () and integrate over all space. 
2 ty co (1) = ihe) 
where 
Hy =| P34; Ar. 
In terms of matrices 
He(t) = ihé(t). (6.72) 


The time-dependence of the wave-function W(r, t) is reflected in the 
time-dependence of the column vector. 

In the ‘Schrédinger’ picture where the expansion (6.71) is employed 
any time-independent differential operator is represented by a time- 
independent matrix. 

When the basis functions used in (6.71) are the energy eigenfunctions, 
the matrix H is diagonal and the time variation of the vector c is given by 


c(t) = eB i#/c,(0). 


The Heisenberg picture 


There is another important way of discussing the time variation of a 
state. This is the Heisenberg picture in which the basis functions used to 
expand an arbitrary function are time-dependent. Suppose ®,(r, f) and 
;(r, £) are solutions of the system Schrédinger equation corresponding 
to the initial states ®;(r, 0) and %;(r, 0) respectively. The Hermitian 
character of the Hamiltonian 9 determines that the ‘inner product’ of 
the two functions remains constant in time. 
d fix ao; 4 08; Z 

ae {z (r, t)®;(r, t)dr -{ ea G; +; af dr =0. 
In particular if the complete set {®;} is orthonormal at ¢ = 0 then it will 
remain so for all time if 

ag; 
HO; = ih ae for all j. (6.73) 
Since the set is complete it can be used to expand an arbitrary 

solution of the Schrédinger equation (6.73) 


Or, 1) = Dor, 0) (6.74) 
I 
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Since the set is orthonormal for all time 
o = [SG OG, D ar 
and from (6.73) the coefficients cj are constants. 
The column vector representing the wave-function is now time- 
independent and the time variation is included in the operator matrix. 
Suppose a is a Schrédinger differential operator representing the 
observable A. The elements of the operator matrix A are given by 
aq, =f OF(, Nod/(r, 1) dr. : | (6.75) 


From Chapter 5 (equation (5.47) and question 4) the elements of A 
are given by 


d ~ i 
ae ot =| x (r, f) [5 + = (Ha — 2)| ®;(r, t) dr (6.76) 


and in matrix notation 


+ dA i 
=—+—. _ . 
A=>~ +> (HA ~ AH) (6.77) 


where the elements of the matrix 0A/dt are 


0A * 0a 
—| =| &f0,)—@, 
(2 if J Fy ear 


If the differential operator is time-independent then Heisenberg’s 
equation (6.77) becomes 


A== (HA AB). (6.78) 


As an example consider the matrices q; and p; representing the ith 
components of position and momentum for a free particle. From (6.64) 


i=1,2,3. 
é t 
P; => (Hp; — p;H)=0 
since 


1 
H = — (pj + p3 + p3). 
am 1 p2 + p3) 
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A useful representation is obtained when the basis functions are the 
energy eigenfunctions so diagonalizing the Hamiltonian matrix. Then 
| the elements of the time derivative matrix A are (from (6.78)) 
| 


: i 
Og = h [E, = E;) On (6.79) 


where E,, and Ej are energy eigenvalues. Integration of (6.79) gives 
,¢ k= Ex -*), 
Oj; (}) = a; (0) exp Fy (6.80) 


In this representation, off-diagonal elements oscillate in time with 
frequencies Ww; = (E, — Ej)/n. 


The density matrix 


The quantum mechanical theory so far developed deals with ‘pure states’ 
for which the state function is completely known. Even in this case the 
value of an observable is not well defined unless the state function is an 
eigenfunction of the observable operator. It is often the case that the 
state-function is not known completely and then additional uncertainties 
must appear. There may be many wave-functions compatible with the 
incomplete information known about the system and the effects of these 
must be suitably averaged. This second statistical aspect, due to the fact 
that not all the information that can be known is known, is dealt with 
by statistical mechanics and occurs in both quantum and classical 
mechanics. 

Such a quantum-mechanical system which is not known as completely 
as is possible, is said to be in a ‘mixed state’. It is useful to introduce the 
concept of a representative ensemble. This is a collection of similar 
systems each described by different wave-functions but with the property 
that the values of the known observables are the same in all systems. The 
values of the remaining observables (of a complete set) form a suitable 
chosen distribution. 

Consider a representative ensemble composed of N systems described 
by the normalized wave-functions V,, V,,..., Ya. Let {Bj} bea 
complete orthonormal set which will be assumed to be discrete. Then 


v;= D ji @; with Ci = { OF; dr (6.81) 
i 


A ‘density operator’ p is defined by specifying its matrix elements in 
this {®;} representation. 


Chi Ck. (6.82) 
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It can be shown that the density matrix p is Hermitian and transforms 

like any other operator (6.56). 
In the special case that the wave-function is known completely (¥; | 

say), the density matrix takes the form | 


* 
Prj = CyiCri- 
The square of this matrix is given by 
2: * *,\e 
(e? Yeej = 2 (Cries) (Cji Cri) = Pr; 
since the wave-function is normalized and 


es ok 
Die =1. 


The density matrix is equal to its own square when the ensemble is 
described by a single wave-function. 

In the general case, the density matrix contains all the physical 
information that is known about the system and can be used to find 
the mean value of any observable. Suppose A is an observable represented 
by the Schrédinger operator a. The expectation value for A for the ith 
system in the ensemble is 


(A) = [Who dr => Leica (6.83) 
] 


where 
OK i | efow, dr. 
The elements ajx form the matrix representing A in the representation 


defined by the set {®;}. The mean value for this observable for the system 
represented by the ensemble is from (6.83) and (6.82) 


N 
1 
[A] “HV > [23ctew] = Trace (pa). (6.84) 
i=1 
This equation is valid in all representations. 
In classical mechanics the observable A is a function of the co-ordinates 
and momenta. The classical average for this observable for the ensemble is 


[4] = | f(2,4, DAC. a) dp da 


where the integration is over phase space and f(p, q, t) is a suitably 
chosen distribution function. The density matrix in quantum mechanics 
is analogous to the classical distribution function. 

In the Schrodinger picture the expansion coefficients are in general 
time-dependent. A simple expression can readily be obtained for 6. The 
expansion coefficients satisfy (6.72). 


> KH yj Cji = thex;.- 
i] 
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From this equation together with (6.82) and remembering that # is 
Hermitian it is easily shown that 


ih ber = & (Hej Oj — Pxj Hal 
i 


6=—= (Hp — pH). (6.85) 


Note carefully that this equation differs in sign from the equation of 
motion (6.78) of an observable in the Heisenberg picture. 

Equation (6.85) does not hold in the Heisenberg picture as the time 
dependence is then in the matrices representing the observable and in this 
case p is a constant matrix. 

For a more complete discussion of the density matrix the reader is 
referred to the review paper Theory and Application of the Density 
Matrix by D. Ter Haar. 


PROBLEMS 


1 Explain how any real square matrix can be written as the sum of a 
symmetric matrix and a skew-symmetric matrix. 


2 Let Abea square matrix with rows and let c; be an eigenvector 
with eigenvalue d, so that 
(A-— d; De; =0. 


An arbitrary column vector x with m rows can be expressed as a sum of 
the n eigenvectors, i.e. 
n 
x= 2 jG. 
i= 
Show that A satisfies the Hamilton-Cayley equation 
(A—d,I(A-—d2,D...(A—d,D=09. 


(Note that is the characteristic polynomial of (6.34).) 


3 Prove that the eigenvalues of a skew-symmetry matrix are purely 
imaginary or zero. 


4 Prove that eigenvalues of a unitary matrix have the absolute value 
one and so lie on the unit circle in the complex plane. 


5 Prove that if the matrix T has eigenvalues d,,d,... then the 
matrix I? has eigenvalues d?, d2, .. . etc. 
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6 Consider the three matrices | 


01 0 -i 1 0 | 
= > = ee 
"1*\1 0 BOVE als POT Migs wd 
Show that 
(a) *? =23 =33 =] 
and 


(b) yt. —T2T; = 2it3. 


Show also that +,,¢2 and v3 have the common eigenvalues +1. (These 
are the Pauli spin matrices.) 
Construct the matrix 


e 
2 ad +73) 


and confirm that it has eigenvalues +e and zero. (This result is used in 
the isotopic spin formalism to distinguish between protons and neutrons.) 


7 Verify that the eigenfunction W,, = \/2 sin (nmx) are orthonormal 
over the range 0 <x <1 and satisfy the Schrédinger equation for a free 
particle in the one-dimensional box 0 <x <1. 

Construct the matrix representing the linear momentum 


ee 
ae i dx 
obtaining 
Pam = —i4anm|/(n? — m7) (n + m) odd 
=0 (n + m) even. 


Confirm that the matrix p is Hermitian. Use the relation 


1 
H=—p? 
2m” 


to obtain the elements #; and #2 of the diagonal matrix H and show 
that 


8h? > 4k? h? 1? 


a Ge -1P 2m 
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8 In the text it was argued that as the quantum mechanical differential 
operators are Hermitian then the representative matrices must also be 
'Hermitian. 
This final result may be obtained without reference to the Hermitian 
character of the operators. 
Clearly, any physical observable must have a representation in which 
it is represented by a diagonal matrix with real elements. Using the 
i result of theorem three show that an observable matrix is Hermitian in 
all representations. 


9 The adjoint of a matrix Tis denoted by T* and is the transpose of the 
complex conjugate, i.e. 

r*=f*. 
Show that a Hermitian matrix is equal to its adjoint (Self-adjoint) and 
that the inverse of a unitary matrix is equal to its adjoint. 


10 Suppose the Hamiltonian operator # for a system can be written 
HEH, +H’. 
Let the orthonormal set {W,,} satisfy the equation 
av, 
ot 


A wave-function W satisfying the Schrédinger equation for the total 
Hamiltonian # can be expanded as 


WY = Den(t)¥alr, 2). 


(Observe that as the functions V(r, f) and V,,(r, ¢) satisfy different 
Schrédinger equations the expansion coefficients are functions of time. 
Compare equation (6.74) for the Heisenberg picture.) 

Show that 


ny 2 
H Xen =i >, 5 en 


Ho, =ih 


and derive the matrix equation H’c = ic. (This representation is called 
the interaction representation. # represents two non-interacting 
systems and #” represents their mutual interaction. It reduces to the 
Heisenberg picture when H’ = 0.) 

Show that the equation of motion of a matrix T representing a time 
independent operator is 


T = ; (Hol - TH). 


Compare equation (6.78). 
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11 Using equation (6.23) show that the trace of a matrix is invariant | 
under a similarity transformation. Hence deduce that the trace is equal | 
to the sum of the eigenvalues. 


12 Consider the definition of the density matrix (6.82). Show that 
under a change of basis functions (6.52-6.54) the density matrix 
transforms like an operator (6.56), i.e. 


eo? =u pu. 
Hence show that 
Trace (pal?) = Trace (pPal). 


confirming that the mean value [A] (6.84) is unchanged by the 
transformation. 


13 It has been shown in the text that the density matrix of a pure state 
is idempotent, i.e. 


p” =p. 
This equation can be written 
e(p —1)=0. 
Explain why the eigenvalues of @ are zero and one. 
Show that 
Trace p = 1. 


Hence deduce that the eigenvalue one is non-degenerate. 


14 InChapter 4 it was shown that the eigenfunctions of the one- 
dimensional harmonic oscillator are the Hermite functions 


Vay) =e” 7H (y) (4.30) 


where H,,(y) is a Hermite polynomial. 
The Hermite functions are orthogonal and satisfy 


[Vadn dy = 2"nl/r. 


Given the recurrence relations 
2nWn-1 = Wnt Wn 
Qyvn = 2nYy-1 + Vat 
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| show that 


= 0 mé#nitl 
J YmyVn dy ={ "(tit ment) 
oP 2" alr m=n-1 
and 
oo 0 m#nil 
{ Ym dy=\—-2@ tye m=ntl 
i 2" n/n m=n—1. 


Multiply Y,(y) by a suitable constant to normalize the wave-functions 
W(x) (solution of (4.27))to unity and confirm the following matrix 
representations for the position co-ordinate and momentum. 


0oVY7i oO 0 O.. 

i 
Si oe 
072 0 V3 0 
0 -+/1 0 
0 


oo Oo 


Using the relation 


oi th atime? 
2m 2 


show that H is diagonal. 
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CHAPTER 7 


Approximate Methods 
of Solution 


7.1 Introduction 


Most problems in quantum mechanics do not possess exact analytic 
solutions. Some do of course, and a few of these have been mentioned 
in Chapter 4. However, approximate methods of solution play an import- 
ant part in the theory and the present chapter, together with Chapter 8, 
will be devoted to a discussion of them. This chapter will deal with 
stationary state problems where the aim is to evaluate the energies and 
eigenfunctions of some time-independent Hamiltonian operator. 

Three methods will be discussed, the variational method, the method 
of linear combinations including the perturbation method and the 
Wentzel-Kramers-Brillouin-Jeffreys (W.K.B.J.) approximation. The 
relationship between the variational and perturbation methods will be 
emphazised. (See also Slater.) 


7.2. The Variational Method 


Theorem 1. The Rayleigh-Ritz variational principle 


Suppose y is a wave-function that is not necessarily normalized. Then the 
quotient 


G)=| vet dr|f vey ar (7.1) 


is an upper bound to the least eigenvalue of the quantum system 
described by the time-independent Hamiltonian # 
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Proof 


As the Hamiltonian is time-independent it has a complete set of ortho- 
‘normal eigenfunctions {¢,,} such that 


Hon =Endn- (7.2) 


An arbitrary wave-function can be expanded in terms of this set. 


W= D cadn Cn constant. (7.3) 
n=] , 


Consider the integral 
{ vrsey dr =| Zetos % Sembm a7 
n m 
= DE, le,|*. 
n 

If E, is the ground state of the system then FE, <£,, and clearly 

[ vt90 dr > By Dien. (7.4) 

n 

But 
Deal? = [wry ar 
n 
and so 

| ysse ar 

[vrv ar 


as required. The equality applies when wp is the exact wave-function. This 
is a most important result. In practice an informed guess is made for the 
ground state wave-function which contains some parameters a;,@2,... 
say. The quotient (7.1) is evaluated and the parameters given the values 
that satisfy 


a) 


0a; 


SE, (7.5) 


0 i=1,2,... (7.6) 


so that the expectation value (#) is minimized. In general the energy 
eigenvalue obtained is considerably more accurate than the wave-function. 

The method can be extended to find upper bounds to the excited 
states. Suppose it is desired to find an approximation to the first excited 
level Ey. The choice for is then of the form 


vex-wi [uixar (7.7) 


where x contains the free parameters. In this case y is orthogonal to the 
normalized determined ground state wave-function y,. If py, is a good 
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approximation to the true ground state function ¢, then the expansion | 
for w (analogous to (7.3)) will exclude @), i.e. 


| 
y= p> CnOn- (7.8) 


It is left to the reader to verify that | 
(#) 2 E>. (7.9) 


Theorem 2. The virial theorem 


The virial theorem states that when a system is in a stationary state for 
which the potential is a homogeneous function of the co-ordinates of 
degree s the expectation value of the potential energy is 


=n (7.10) 


where (7) is the average kinetic energy. This theorem holds in both 
quantum and classical mechanics. 
Proof 


Consider a quantum mechanical system composed of n particles of 
masses m;. If they are at positions r; the Hamiltonian operator is 


n h? 
#2 —S > VP + Vy. tn) (7.11) 
isi 


where V} operates on the co-ordinates of the ith particle only and 
Viry, . - stn) is the potential energy of the system. Since V is homo- 
geneous 


Viar,,...,Q%,) =a°V(r,...,1n). (7.12) 
Let x(r1,.. -) fm) be an arbitrary function and define the quotients 


Re 
(x*Giicote) D5 Vix(t, ++ -s¥n) dT 
ot 


A a 
[xt eee In) Xf, - Bn) aT 
pace oe In) V(r1, - «5 In) X(t15+ tn) AT 
2B = 


fxr. $1699 tn)X(1, aie Hey Tn) dr 


the integrations being over all values of the co-ordinates. 
Consider now the trial wave function 


Wty, .- tn) = X(Or1, . - fn) (7.13) 
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where a is the variational parameter. The expectation values of the 
kinetic and potential energies are respectively 


{T)=a7A and (Vy=aSB. 


(To show this carry out the transformation ry =ar,i=1,...,n and 
note that the Jacobian is the same constant for all integrals and cancels.) 
The expectation value of the Hamiltonian is 


(HM) =07A +a SB 


and the best choice for a is that which satisfies 


KO 
3) 9 220A — sa-8— 1B 
0a 

or 

22 -s 

5 & A)=(a*B) (7.14) 
then 

(yen. 

s 


In the case that the particles have a charge e; respectively and interact 
through a Coulombic potential 
n 


CG 
Lee = —_—_— 7.15 
Viewsat)" >, >, Facoln oy (7.15) 
P=1 >i 
and s = —1. Then 
(Vy =—-2(T) =4+2#). 


This result must hold in the case that w is an exact eigenfunction of 
the Hamiltonian with energy £ and then 


=(T)=4(V). (7.16) 
Note that the kinetic energy has a maximum value in the lowest energy 
state. 
Ground state of the helium atom 
The Hamiltonian operator for the helium atom may be written 
2e? 2e? + e? 
Anéor, 4m€of2 4meéolry — 12! 
(7.17) 


where the two electrons are at r; and r, respectively. The kinetic energy 
of the nucleus and the spin-interaction terms are ignored. 


h? 
#2 —> (i + V3) 
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If the electron-electron interaction term e? /(4m€9|r, —r2|) is ignored, 
the Schrédinger equation is separable and the helium wave-function is 
simply a product of two ground-state hydrogen type wave-functions 
(see (4.73)), i.e. 


3 
Yum) =— (2) exp lay (r, | (7.18) 


In fact the two electrons will screen each other from the nucleus and 
so a reasonable simple trial function is to write 


3 
vonnd=2(2] exp se +)| (7.19) 


where a is a parameter. This wave-function is already normalized. 
The expectation value of the kinetic energy is clearly 


(T)=2(Ty) (7.20) 


where (7;,) is the expectation value of the hydrogen type atom (Z = 
in its ground state. From the virial theorem (7;,) is equal to the binding 
energy of the hydrogen type atom in its ground state and so from (4.71) 


aetm\ 1 
cry=a{ 2H lets 


a? e 


TTR (7.21) 


Similarly, the expectation value of the potential energy is 


(V)= (Wy) + ee (7.22) 
a # 4neglr, —12|/’ : 


where (V;,) is the potential energy of the hydrogen type atom (z = a) 
and from the virial theorem 


—a?2 e2 


aaeea) (7.23) 


(Vi) = 


The remaining term in the potential energy (7.22) represents the 
interaction of two superimposed spherical symmetric charge distributions 
and it can be shown that 


Cos ; ry —1 aaa) +i ae 29) 
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From equations (7.21)-(7.24) the expectation value of the 
Hamiltonian is 


2 
(= ( da + «| é (7.25) 


8 | 4mepag 


The best value for the parameter is obtained from 


2 
aH) _ e (20-7)=0 


da 4n€9 ao 8 


a= 3. (7.26) 


The least upper bound for the ground state energy that can be 
obtained from the trial function (7.19) is 


2 2 2 
—£__ »_7.843 — (7.27) 


Atego 4n€gdg 


The corresponding binding energy, that is, the energy required to remove 
both of the electrons, is +2°848 e?/4m€qao. The experimental value is 
+2904 e? /4tegag. 


73 The Method of Linear Combinations 


As has already been mentioned in the introduction, this chapter deals 
with stationary perturbation theory in which the Hamiltonian is time- 
independent. The variation principle may be used to derive the appro- 
priate expansions. 

The object is to find the eigensolutions of some given Hamiltonian 
SH. That is, to solve 


Hy =EV. (7.28) 
| Let {xj} be a set of n linearly-independent functions which for the 
moment will not be assumed to be orthogonal. Observe that this isa 


finite set of functions and so is not complete. Suppose this set is used to 
give an approximate expansion for an eigenfunction of #,, i.e. 


v = p> CjXj- (7.29) 
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The wave-function (7.29) may be used in a variation problem where the 
expansion coefficients are regarded as parameters. The expectation value 
for the Hamiltonian (7.1) is 


n n 
[3 ctetae 3 eran 


i=1 


n Par nm 
iP CrXi 2, C7Xj ET 
iz 


7=1 


eo (7.30) 


Siz = { xtx; dr. (7.31) 


(7.32) 


H and S are both Hermitian and c is a column matrix. A very simple 
argument shows that this expectation value is real. As a scalar quantity 
is equal to its own transpose 


¢Hc* 


O)= a 


However as H and S are both Hermitian then 


This final result shows that (#) is equal to its own complex conjugate 


and so is real. 
The best trial solution of the type (7.29) is that which satisfies 


OH) _ 


z-=0 1£1,2,...,n (7.33) 
dc; 
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n 
> Syey}=O 11,2... 50 
i (7.34) 


Writing E, for the real approximate value of the energy, thesen 
equations are in matrix form 


He — E4 Sc =0. (7.35) 


If (#) is differentiated with respect to c; instead of c;' an equivalent set 
of equations is obtained. 

The equation (7.35) can be obtained directly by substituting (7.29) 
in (7.28), integrating and replacing E by £4. The condition for the set 
of n equation (7.35) to have a non-trivial solution is that the characteristic 
determinant be zero, i.e. 


det (H — E,S)=0. (7.36) 


The determinant (7.36) is a polynomial of degree n in E., and the lowest 
root is the best approximation to the ground state of the system. 


Hydrogen-molecular ion H} 


This is the simplest molecular structure and may be approximated by a 
one-electron problem, the two nuclei being in fixed positions. The 
electron Hamiltonian is 


2 2 22 2 
we Po lee + e 
2m 4neg\r, 72] 4eoR 


where 7,72 are the distances of the electron from the protons which are 
a distance R apart. This problem can be solved exactly in elliptic co- 
ordinates and the energy found for different values of R. The minimum 
energy occurs when R = 1-06 A in agreement with experiment. The 
corresponding binding energy is 2°8 eV. 


Figure 7.1 The hydrogen molecular ion. 
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However, as an illustration, this problem is solved by expressing the 
molecular orbital as a linear combination of atomic orbitals. The simplest 
approximation to the ground state is 


W Heyyy terV2 
where 1, 2 are the real normalized hydrogen (1s) wave functions 


centred on the two protons and ¢,,c2 are constants. 
The best solution of this type is obtained by solving the characteristic 


determinant 


Hy, - Es Hz —- EsS1 
Hy ~ E489 Hr. -—E a 


ul 
(=) 


where 


Sy =Sa =| Vivo dr 


as , and W, are real functions, and also 
Hi2 = Hn = (vib. dr 
KH, =H = [W290 dr. 

The two roots of the determinant are 


-#un? Hn 
j] ea Sy ‘ 
As W, and W2 are both positive in the region of overlap then it can 
be shown that 3,2 < 0 and so the lowest root is 


E Ay +Hy 
"Al eee 


E, 


This corresponds to a binding energy of ~1-8 eV which involves an error 


of 1 eV. 
The values of c, and c2 are obtained by solving the two equations 


(HA — Eades + (42 — EgSy2)c2 =9 
(Hy — E4Sa)e1 + (#22 - Ea)e2 =0. 

By putting £4 = £4, it can readily be seen that 
€) =C2 


and so the normalized ground state is 


Wg = (1 + Y2/V201 + S42)- 
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This is an even function, (gerade or g-state) and represents a bonding 
state in which charge is transferred to the bond, i.e. 
1 
Ye ren ers ne 2 +2 + 2 : 
lel 21 + Sy) (Wi + 2WiH2 +2) 


The other root of the determinant corresponds to the odd function 
(ungerade or w-state), i-e. 


Wu = (Wi — W2)/V/20 = S12) 


and represents an antibonding state in which charge is removed from the 
bond. 


Theorem 3 
Let £, be the rth root of (7.36) and cyj,j =1,...,7, the corresponding 
set of expansion coefficients. The n approximate eigenfunctions are 


n 
v= 2 CyX; r= 1,2,...5. (7.37) 
J= 


These approximate eigenfunctions are orthogonal. 


Proof 


Let Eq and E, be two different approximate eigenvalues with corresponding 
eigenfunctions Wq, W, 


n n 
fvav, dr= 2 Po CaiCrj [xix dr = pes aii Cri 
i=1 j=l ij 
= & Sc,. (7.38) 
From (7.35) 
He, =E,Scg (a) He, = £,Sc, (b). 
Transpose (a) and take the complete conjugate 
xk ~k Oe 
aH =E,e,S. 
As the matrices H and S are Hermitian 
éjH = E4&S. 
Post multiply this equation by c, and premultiply (b) by a 
Hc, =EgeSe, tHe, = E,&7 Sc, (7.39) 
On subtraction these equations give 
(E, — E,)€qSc, = 0. 
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If the eigenvalues are not degenerate, this equation implies | 
&7Sc, = 0. 

From (7.38), the approximate eigenfunctions are orthogonal. If these 

functions are normalized then 


| ve, dr = Sqr. (7.40) 


Discussion 


The linearly-independent functions fx}, 7=1,...,2 define an n- 
dimensional sub-space of the system function space. In the Rayleigh-Ritz 
variational method the best approximation to the ground state is 
obtained by minimizing the quotient (7.1). The best approximation to 
the rth excited state is obtained by minimizing (7.1) subject to the 
condition that the trial function is orthogonal to the (r — 1) approximate 
eigenfunctions for the lower eigenvalues. The results of the Rayleigh-Ritz 
method are upper bounds to the true eigenvalues. 


Calculated levels 


Energy 
n=3———- n=4 


True ground state 


Figure 7.2 


The method of linear combination is essentially the Rayleigh-Ritz 
method with the trial functions confined to the n-dimensional sub-space 
{xj}. The 1 roots of the characteristic determinant 


D,€,) = det (H — £48) =0 (7.41) 


give the corresponding best possible approximations to the lowest eigen- 

values. Of course the approximation is best for the deepest levels. 
Consider now the determinant D,+,(£4) constructed from the 

(1 + 1) independent functions xj, . . ., Xn» Xn+1- This problem is 

equivalent to Rayleigh-Ritz minimization method with the trial 

function wp restricted to the (mn + 1)-dimensional sub-space spanned by 

ix, f=l,...nt+l,ie. 


n+l 


y= 2 Cj X;- (7.42) 
r= 
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The greater flexibility of the form of y implies that the roots of 
Dny+1(Ea) will be less than or equal to the corresponding roots of 
D,(E4). If the rth roots are written E{"*) and E(”), the superscript 
denoting the dimensionality of the associated function space, then 


EO) < EM), (7.43) 


The roots of the new determinant D,,4,(E.4) give a better approximation 
to the previous n eigenvalues as well as introducing an approximation to 
another eigenvalue. 


Perturbation of non-degenerate levels 


The method of linear combinations can be used to obtain the well-known 
perturbation expansions. Consider the case where the Hamiltonian can 
be written 


H=H? +h (7.44) 


where the energy associated with #° is large compared with that 
associated with 4. 9° must be chosen so that its eigenvalues and eigen- 
functions are known. Let the functions {x,} be these eigenfunctions 
0, = 70. 
KH xi = E 7 X;- 
It is convenient to consider the Hamiltonian 
H=RM° +k = O0SAK<1. (7.45) 


The parametric value \ = 1 corresponds to ‘complete’ perturbation and 

d =0 to zero perturbation. The Hamiltonian (7.45) is solved and then 

put equal to unity in the results to obtain the required solution of (7.44). 
The unperturbed Hamiltonian eigenfunctions {x;} are orthogonal and 

may be chosen to be normalized. Following (7 .29), the first n of these 

eigenfunctions, corresponding to the 1 lowest energies, are used as an 

expansion set. The overlap integrals (7.31) are 


Sij = 84 
and the elements of the Hamiltonian matrix are 
_ 70 
Kj =E; by + Ahi 
| where 


hi =| xtAx; dr. 


The nv equation (7.35) become 


n 


> (EP dy + Ahy — Eady)oj = 0 1=1,2,...,m. (7.46) 
1 


j= 
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In the unperturbed case A = 0, the determinant of (7.46) is diagonal | 
and 


n 
TI GP - £4) =0 (7.47) 
j=l 
giving the 7 solutions 
E, =E%,E8,...Ep 


as expected. 

The basic assumption of perturbation theory is that it is possible to 
expand the perturbed energies and eigenfunctions as power series in r. 
There are some cases when this assumption is invalid. For example if 
#° has a discrete spectrum and # has a continuous spectrum. For the 
ground state, the unperturbed eigenvalue and eigenfunction are E ? and 
X1, respectively. The perturbed ground state energy is written 


Ey, = Ef +rEf? +07Ef? +... (7.48) 
and the perturbed ground state wave function 
Wr =X1 TAME) HA? YIP +... (7.49) 


E (1) is of the same order as & and E (2) is of the second order of smallness 
(The reader is warned not to confuse £ () etc. with the entirely different 
meaning given to E{”) in (7.43).) 

With the energy expansion (7.48), the characteristic determinant of 
the equations (7.46) becomes 


AAy, — AEP — 7B, Ay, .-- Min 
Aha, Ahgg + ES — E29 —-AE{Y — n7£f),... 

Ahn, 
(7.50) 


This determinant is a polynomial in ) of degree n and is identically 
zero for 0 <A <1. The coefficient of each power of A must vanish. 
The term in A, corresponding to the first order in smallness, is the 
diagonal determinant 


AG — Ef), 0, 0,...,0 
0, E? — Ef, 
E3 -£?, =0 


1) E2—-E? 
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&. 
I 


(hy, — Ef )(E9 — Ef)... 2 - £2) =0. (7.51) 


If the ground state eigenvalue FE? of the unperturbed Hamiltonian is 
non-degenerate 


ED = ay =f xtAxr dr. (7.52) 
This is the first-order perturbation of the ground state energy. By taking 
Eja = EP + rE) +B +... (7.53) 


it can similarly be shown that the first order perturbation of the jth 
state, (assumed non-degenerate), is 


sa a * 
EL) = hy = xi AX; At 
and so 
Eya = Ep +2 | xfhx; dr. (7.54) 


For complete perturbation A = 1 and so to first order, the energy of the 
perturbed jth state is 


Ejq = Ep + [xpAxy ar. (7.55) 


(Strictly Ej is a power series in \ and ought to be investigated for 
convergence to decide if it is valid to put \ = 1.) The energy (7.55) is 
the expectation value of the Hamiltonian (7.44) for the unperturbed 
wave function xj. 
The first-order change in the ground state wave function is expanded 
in terms of the first m functions of the set {xj} so that 
n 


y= > Mx: 


i=1 
and 


n 
Wr =C1 ref) xy FAD of; 
i=2 


When £4 is given its value (7.54), the m equations (7.46) are, keeping 
first-order terms only, 


Ay +(EP—EP)cf=0 1=2,3,...,0 


h 
dQ). "A =2 3 
Cc; Fo — Ep 1=2,3,...,”%. 
To first order in X, the first of the equations (7.46) is the trivial identity 
zero equals zero and so c{) is not obtained by this method. The value of 
c () is chosen so that y, is normalized and by considering first-order 
terms it is readily shown that the choice c{!) = 0 is satisfactory. 
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So, the coefficients in the expansion (7.29), for the ground state wav 
function, are to first order 


h 
a= Ms. gs ee : 
1 Po oc te (7.56) 


By an exactly similar argument the coefficients for the jth state wave 
function are, to first order 


7 _ Ay : 


and for complete perturbation with \ = 1, the jth perturbed wave 
function is 


n 
hy 
Vv; =Xj + 2 EP — EP XI (7.58) 
J] 


The second-order changes due to the perturbation can also be calcu- 
lated. To find the second-order change in the energy of the ground state 
the determinant (7.50) is expanded by the first row keeping only those 
terms in A?. Then 

{EP (ES — EES -— EP)... En - E?) 


—lhyI?(E3 — £2)... (2 — £2) — |ty3? Eo — Eo)(Ea — EP) 


... 9 — £9)...-IhinP? ES — £9)... G9, - EP} =0. 
This is equivalent to the bordered diagonal determinant 
-VEP) Ohhy Mia wee, NR 
Aha, ES-E? 0 res ) 
hay 0 yop dee OA =0. 
hat 0 Ee-E? 


On the assumption that the ground state energy is non-degenerate, 
the second-order change in the energy of the ground state is 
lay? 
0. 70° 
jee es 
This result may be obtained directly by substituting (7.56) into the 
first of the equations (7.46) and picking the second-order terms. Observe 


that the second-order change in the energy of the ground state is always 
negative as EP > EP. 
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By a similar argument, the second-order change of the energy of the 
jth state, (assumed non-degenerate), is 


n 2 
2) = S | hyal 
Ef J=a_ roamed LP 


0 0° 
l#j Ey —£; 
So to second order, the perturbed energy of the jth state with A = 1 is 


2 | Ay,l? 
izj 7 


Note that if the wave function is known to first order (7.58), enough 
information is known to calculate the energy to second order. 

If the number of basis functions is increased the calculated value will 
be improved. In the case that the complete set of eigenfunctions of #° 
is used, the summations (7.58) and (7.59) include an infinite number of 
terms. These are obtained directly in the more usual method of obtaining 
the perturbation expansions but this does not show the relationship 
between the variation and perturbation methods. 

The perturbation method is applicable when although the true 
Schrédinger equation cannot be solved, solutions can be found for a 
derived equation, which differs from the true one by terms that are 
small. Often this cannot be done or perhaps the convergence is poor. 
Then the variation method is often useful. 


Van der Waals interaction 


The attractive forces that hold most atoms together to form molecules 
are very short range and can be ignored for atomic separations greater 
than a few angstrém units. Even so there are weak attractive forces that 
act between neutral atoms at separations large compared with the Bohr 
radius. These are the van der Waals forces and they vary as 1/R® where 
R is the atomic separation. They are due to a mutual polarization of the 
two atoms and help to determine the properties of gases. 

Consider two hydrogen atoms at a fixed sitance R apart. This is a 
two-electron problem and the Hamiltonian is 


hh? 1 2 2 2 2 
H=— (Vi + ae pe ge. 


e? e? 


+ 
4n€ori2 4regR 


where rjq,7ip are the distances of the first electron from the first and 
second protons at ‘a’ and ‘b’ and similarly r2,72, are the distances of 
the second electron from the protons. The electron separation is rj 
(see Figs. 7.1 and 11.2). 
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The Hamiltonian 


2 2 2 
silat AURA eo Eo +2) 


2m Anég\ rig Tap 


represents the separated hydrogen atoms and has known exact solutions. | 
The ‘atom-atom’ interaction Hamiltonian 


Pa: ck ek ae oe ee | 
A= — +— J + — ] — + — 
4nég\r1p Toa} 4t€9 \r1. R 
can be regarded as a perturbation. 


For the ground state both electrons are in the (1s) state (about 
different protons) and the wave function is 


1/1\° 1 
exp vgomd=2(2] exp |- dt. +r) 
0 


It is convenient to choose a co-ordinate system with the z-axis along the 
line of atoms and an expansion of & in powers of 1/R, keeping the first 
term only, gives 
e 
h ~ GneoR® (X1a%25 + ViaV20 — 2219725) 


where (X1¢,¥ 1a, 21a) and (x25, Y2p, Z2p) are the co-ordinates of each 
electron relative to their respective protons. This can be written 


Fone 3(P1 - R)(p2 - R) 
saa (P *P2— R? 


where 
Pi = —eXygi +Yiaj + Zak) 


and similarly for pp. The perturbation represents the mutual interaction 
energy of the two atomic dipoles at any instant. 

Since & is an odd function the first-order perturbation integral 
vanishes. The second-order energy change of the ground state is 


|hy|? 


BSS Sooo 
io ~#y 


where hy; is the matrix element between the ground state and the /th 
excited state. Since EP >£? the energy change is negative and represents 
an attraction between the atoms. The form of the perturbation indicates 
that E{a — 1/R®. 
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Eisenchitz and London (Z. f, Phys. 60, 491, 1930) carried out this 
calculation and obtained the result 


EP oa 6°47 eas : 
(4ne€9)*R® 


The second-order Stark effect 


Consider the effect of an electric field &, acting along the z-axis, on the 
ground state of the hydrogen atom. The unperturbed Hamiltonian is 


Po eo 
2p 4n€or 


where jt is the reduced electron mass. The perturbation & is the extra 
energy of the system due to the electric field and is equal to —p.& 
where p is the dipole moment of the atom, i.e. 


4=e &z. 


(The electron change is — e). 
The unperturbed ground state wave-function is (4.73), 


1 \12 : 
=t-— ev o 
W100 (5 


As this function is spherically symmetric then clearly the first-order 
perturbation to the ground state energy is zero. 

However there is a second-order change in the energy. From (7.59) 
this is 


2 | Zz 7 
2)= 5 2 lq 
EP) =-e6? 5 a BS (7.60) 
q=2 

where Z1q = J Wio0ZWq d7 and Wq is an unperturbed excited hydrogen 
wave-function. From the form of the hydrogen atom wave-functions 
(4.72), it is clear that 24g is non-zero only for those excited states for 
which the quantum number m = 0. By further symmetry arguments it 
can also be shown that zg is non-zero only when the excited state / 
quantum number is one. 

The change in the ground state energy is proportional to the square 
of the field strength and a polarizability a may be defined so that this 
change is —a 82/2. From (7.60) 


iZ1ql" 


a =2e = ET) ie 
Ey - EL 


q= 


(7.61) 
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The dominant term in (7.61) is clearly that obtained from the (2p,) 
state when n = 2. A fairly short calculation then shows that 


a > 2-9(a3 4ne€9). 


The experimental value is a = 4-5 (a34n€9). 


Perturbation of a degenerate level 


The perturbation theory developed up to this point has been concerned 
only with non-degenerate levels. However, degenerate levels occur very 
frequently and this work must be extended. This is not in fact very 
difficult. 

It is clear that the results already obtained do not apply for a degener- 
ate level as there would then be singularities in both the first-order expan- 
sion coefficients (7.58), and in the second-order change in energy (7.59). 
Examination suggests that these difficulties could possibly be removed 
if the perturbation interactions hj; between degenerate levels, could be 
chosen to be zero. 

As a simple example consider the case of a two-fold degenerate level. 
In particular, in the previous treatment, (7.44-7.59), let EP = E29. The 
first- and second-order corrections to these degenerate levels are given by 
the solutions of (compare (7.50).) 


Ay — ESP — 7B, Ale wine Ain 
Moar, Aig — Ef? — NEP), 
M33 +E$ — EP - rE) — v2zP),...] =0. 
Miny 
(7.62) 


This determinant is again a polynomial in A. The lowest power is \? 
and this term is the determinant 


Ay, — AEP, Ay, 0,..., 0 
rAd, Ahg2 — AE$”, 0 
0 0, E$—&? =0 
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i.e. 
(hy — EP )(ha2 — EM) — |hy2? =0. (7.63) 


The two roots of this quadratic give the first-order changes in the 
degenerate levels. Generally these roots are distinct and the perturbation 
removes the degeneracy. 

The degenerate basis functions can always be chosen so that the sub- 
matrix referring to the degenerate levels is diagonal. That is, in the 
present case, it is possible to obtain from x, and x2, two new basis 
functions. 


t 
X1=€11X1 FCpX2 
X2 =C21X1 +C22X2 (7.64) 
so that the off-diagonal elements 
hig = [xitéxa dr=0, hy = [x3*Axh dr=0. 
In this case the two solutions of (7.63) are 
EM =n, = {xi*4xi dr and Ef)=ni, = [xitaxs dr. 


The first-order effects of the perturbation has split the originally 
doubly-degenerate level E?, into the separate levels 


E? +hiy and EF? +hyp. (7.65) 


The functions x4 and X2 are the zero-order wave-functions in that the 
perturbed wave-functions must approach them as A > 0. For the 
perturbed level EP +h}, the first-order wave-function is 


Wi =xi + NE + 2 §x,| teen (7.66) 


Keeping first-order terms only, equations (7.46) give (after 
diagonalizing the sub-matrix), 


hn +(EP -E®)e=0 1=3,4,...,n 
where 
his = | xtAx; dr. 


There are no first-order terms in the first two equations. The second-order 
terms in the second equation give 


(hoo — hiy)c$? + Py hy yo? = = where = hg; = Ixsthxy dr. 
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So the first-order coefficients for the wave-function with perturbed 
energy FE? +h}, are 


cy =1 cf) = S hag his 
= aera 7 re rey | eer | AE 
: + (aa — h(E? ~ F) 


1>2. (7.67) 


It is left to the reader to find the first-order wave-function correspond- 
ing to the level £2 +h}. 

This procedure can obviously be generalized to a degenerate level of 
any order. The problem is really to find the correct ‘zero-order’ 
(unperturbed) wave-functions that diagonalize the appropriate sub- 
matrix. With this choice of basis functions the first-order change in the 
energy is given by the same formula as in the non-degenerate case (7.52). 

The second-order changes in the energy can also be calculated. In the 
doubly-degenerate example discussed above the expansion of the 
determinant (7.62), keeping only those terms in \3 gives (with E M = =hy 
and hin =h3, = 0) 


MEP (hy. — E{P)(ES — EP)... (69 — £2) 
—3|h13)? too ~— E{Y)(E8 — £9)... (62 - £8)... 
Vl hinl? Geo. — EfP)ES — £9)... 8, - EP) =0 


ie. 
VE 0 h's Msn 
0 Aha. — EW) 0 
Ah, 0 €$ - £9) 26 
Mat 0 (Ef - Ef) (7.68) 


(This is of course the determinant (7.62) with the correct zero-order 
basis functions.) ; 
The second-order change in the level F? +h}, is 


E® =— . Nim? (7.69) 
Em —E 
m=3 


The second-order change in the level FE? +h}, is 


lhom|? 
) = IE 2m\ 
Ef > PE (7.70) 
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These results can be extended to a degenerate level of any order. 
When the correct set of basis functions is used the non-degenerate formula 
(7.59) applies except that the sum excludes all unperturbed levels 
degenerate with the level considered. In this way the singularities are 
avoided. 


The first-order Zeeman effect 


It is well known that degenerate atomic levels may be split when a 
magnetic field is applied. This is the Zeeman effect, and the theory for 
the hydrogen atom is outlined below. 

The classical Hamiltonian for a charged particle in an electromagnetic 
field was derived in Chapter 1 (1.45) and it was shown in Chapter 3 
(question two) that to obtain a Hermitian operator on quantization it 
should be written (for an electron of charge —e and mass 4) 


2 


Pa f et At+A. p)+ <4? ev (7.71) 
where the magnetic induction and electric vectors are given by 
B=curlA and é= —gad v— A. (7.72) 
The quantum mechanical Hamiltonian is 
x=_h ys S ai A. grad aie div A + sf _ eV. 
(7.73) 


For a hydrogen atom in a uniform induction field of magnitude B 
parallel to the z-axis the vector and scalar potentials may be chosen so 
that 


Ay: apes. ae vee - au 
et a ae 4 ~ Ane€ or’ Cy) 
As div A = 0 and 
1 a a 1_ 90 
.grad =~B{-y — +x—] =—B— 7.75 
See 32 ( ax =>) 2 d¢ aa) 
(where ¢ is the polar angle) the quantum Hamiltonian is 

n? e? e Ba 
#2 — > V? — m2 22 © pers yr, (7.76 
2p 4néor u os OG «By Gere. ) 


For the fields normally met in the laboratory the term in B? is much 
smaller than the electric field term —e? /4megr and can be ignored. The 
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unperturbed Hamiltonian is that of the free hydrogen atom and the 
perturbation term is linear in B. 


n? e* ihe B 2 
= — Pha . 
Qu A4n€pr’ pe 2 d¢ (7.77) 
The eigenfunctions of the unperturbed Hamiltonian are 
Rn (r)PI""(cos 0) e”". (4.72) 


The quantum number n determines the energy and there are degenerate 
eigenfunctions for 


1=0,1,..,.(@2-1),  -m=-l,—(-1),....0,...,41h 


The functions (4.72) are already eigenfunctions of 4 with eigenvalues 
ehBm,/2u and so the perturbation matrix is diagonal. When the magnetic 
field is applied the degenerate levels with the same quantum number / 
and 7 are split into the (2/ + 1) levels with energies 

ehB 
En +3 om =—1,..., 41 (7.78) 

These energy levels explain the spectrum of the normal Zeeman effect. 
However there are additional effects due to the spin magnetic moment of 
the electron and these will be discussed in Chapter 10. 


7.4 The W.K.BJ. Approximation 


This method carries the names of G. Wentzel, H. A. Kramers, L. Brillouin, 
and H. Jeffreys and it can be used to deduce the Bohr-Sommerfeld 
quantum conditions from quantum mechanics. The W.K.B.J. approxi- 
mation is applicable to ordinary differential equations. That is, one- 
dimensional eigenvalue problems or three-dimensional problems that can 
be reduced to one-dimensional form by separation of variables. It is 
useful when the potential is a slowly varying function of position in the 
sense that the fractional change of potential is small over several de 
Broglie wave-lengths. This will arise in the case of large quantum numbers 
when the wave-function has many nodes and the distance between 

them is small, corresponding to a short de Broglie wave-length. This 
method has been termed ‘quasi-classical’ in that it is most appropriate in 
the almost classical limit of large quantum numbers. Consider the one- 
dimensional, time-independent Schrédinger equation, 


(E — V(x) =0. (7.79) 
In Chapter 4 it was shown that the classical limiting form of the wave- 


function is essentially exp (S/n) (4.9) where S is Hamilton’s characteristic 
function. 
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Look now for solutions of (7.79) of the form 


W(x) = exp @4/A) (7.80) 


where A (x) is some function of position. For y to satisfy (7.79) A must 
satisfy the non-linear equation 


2 2 
(#4) ~ in 4 ~ 2m(E- V)=0. (7.81) 


Assume that it is now permissible to expand A (x) in series in h. 
nr 
A(x) = Ao(x) +hA,(x) + A2(x) Hoc (7.82) 
The classical limit corresponds to # > 0 and as expected A(x) is related 
to Hamilton’s function S(x). Quantum effects are included in the higher 


terms. If (7.82) is substituted into (7.81) and terms in and higher powers 
ignored then, remember 2? is small 


(*é2) =2m(E ~ V), (7.83a) 
i.e. 

Ag =+ | Qm(E~ Vy}? ax. (7.83b) 
This is the expected result since the classical momentum 

p(x) ={2m(E — v)}” 
and Hamilton’s function 

S(, t)=—Ett fr dx. 


A wave-number k(x) may be introduced with 


2m (E ~ V) = p?(x) =h?k? (7.84) 
and then 
Ao(x) = +h { k(x) dx. (7.85) 


If the first two terms of (7.82) are now substituted into (7.81) and 
the terms up to and including first order in # retained 


dAo\? dAo\{dAi\ . d7Ao 
—! —) _m —2 -2m(E - V)=0. 
(4 v2a(4 \(% dx? ma ) 
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From (7.83) this becomes 
do 
dA, _i dx? 
‘dx 2 ddo 
dx 


In terms of k, 
Ay(x) => Ink (@). (7.86) 


The approximate solution of Schrédinger’s equation (7.79) obtained 
by using the first two terms in the expansion (7.82) is 


W(x) = aA [K, exp(i fx dx) +K> exp (-i k dx)| (7.87) 


where K, and K2 are constants chosen to satisfy the boundary conditions. 
If E> V,k is real but in the classically forbidden regions where V > E, k 
is imaginary. 

The validity of the approximation requires (in (7.81)) 
d?A “4) ; 
—z| <li] | - 7.88 
dx? (is ( ) 


If this is evaluated using Ag, and writing A(x) = 22/k(x) 


av 
E-V/ 4n 


That is, the approximation will be good if the fractional change in 

(E — V) over the de Broglie wave-length A is small. Clearly the approxi- 
mation breaks down near those points where E = V(x). Classically these 
are the turning points of the motion where the kinetic energy and hence 
the momentum is instantaneously zero and the particle changes 
direction. 

At these turning points k = 0 corresponding to an infinite de Broglie 
wave-length. Then the approximate form (7.87) is singular although the 
exact wave-function will not be. In fact the expansion for A(x) (7.82) is 
not a convergent series but is asymptotically correct and can only be 
used several wave-lengths away from a turning point. 

Turning points generally occur in problems and a method must be 
developed to deal with them. Suppose there is a single turning point at 
x =a and 


E<V(@) x <a, 
E>V(x) x >a. 


<1. 


177 


Figure 7.3. 


In some region about ‘4a’ say,x; <x <x, the approximate wave-function 
(7.87) is not valid. 


Outside this region (7.87) may be used with k given as below. 
(a) For x >x, 


2 1/2 
p= V)'2>0 (7.89) 
(b) For x <x, 
k=iy 
and 
2 1/2 
y= Coe (V-E)'2 >0 (7.90) 


The physically acceptable solution must vanish as x > —°° and is 
1 a 
vx ~ a exo —fras| x<Xy (7.91) 
x 


Observe that the exponent decreases as x moves away from ‘a’. The 
problem is to find the corresponding solution of the form (7.87) valid 
for x > x. 

As V(x) is analytic it can be expanded in Taylor series and in the 
linear approximation it is represented by two terms 


VQ)=E-B(x—a)  B>O. (7.92) 


In the region x; <x <x where (7.87) does not apply, Schrédinger’s 
equation is solved exactly using the potential (7.92), i.e. 


d?y 2m 


we 12 BO - ay =0 Xp <x <x. 
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The independent variable may be changed to 


hi? 
and then 
d?y 
oo yy =0. (7.93) 


In terms of this variable the exponent in (7.91) can be written (from 
(7.90), (7.92)) 


. g 
frax=<y*? (7.94) 
3 
x 
and similarly (7.89), (7.92) 
r 2 
fax =F ly. (7.95) 
a 


Equation (7.93) is Airey’s differential equation. One of the two solutions 
is finite for all y and the asymptotic form of this for large | y | is 


1 2 
vy) ~ pia Boe (- 2 y=] y>o (x <a) 


(y)~D —Ta ris cos 3 yer 2) y<O0 3 =(x>a). 
(7.96) 


(The second solution is also oscillatory for y < 0 but increases 
exponentially for y > 0.) 

From (7.94), (7.95) and (7.96) it is clear that the solution of the 
form (7.87) that is valid for x > x2 and connects with (7.91) is 


x 
v(x)~ pz ra jrar4 x > x2. 
a 
This can be written 


1 f 2 r 1 
7a exp —[ vax uae [kax—7 : (7.97) 
x a 


The direction of the connection arrow is important. For further details 
see Kembles’s Quantum Mechanics. 
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Figure 7.4 


The Bohr-Sommerfeld conditions 


The problem here is to find the energy levels for a particle moving in a 
one-dimensional potential well. 

Consider the diagram. For any energy £, there are two classical 
turning points, say x =a and x =b. 

The asympotatic W.K.B.J. solution valid outside the region a<x<b 
must converge as |x| ° and so (from (7.87)) 


1 a 
Ve) ~ in exo [7a x<a 
x 


] x 
V(x) ~ Tira exe [-[ res] x>b. 
b 


The connection formula at x =a is (from (7.97)) 


1 ¢ 2 ; T 
ya exp B Y a > pa cos| | k dx — =| (7.98) 
Similarly the connection formula at x = b is 
2 i T 1 r 
jin | frax—4| < Tin oP [-re| (7.99) 
The |.h.s of (7.99) can be written 
2 ri 1 
qn cos| ft ax tal (7.100) 


where 


? T 
an | kax—>. 
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For the two connection formula to be consistent the r.h.s. of (7.98) 
must be the same function as the I-h.s. of (7.99). Since a sign change is 


immaterial then a must be zero or a positive integral multiple of 7, i.e. | 
| 


b 
frax—2 =nn, n=0,1,2,..% (7.101) 
a 
The integral over the period of the classical motion is 
b 
pk dx =2 | kdx=2m(n +4). 
a 


Writing p =7%ik this condition becomes 
$ pdx =(n +4)h, n=0,1,2,.... (7.102) 


The equation (7.101) (or (7.102)) determines the eigenvalues of the 
problem. The last equation is the Bohr-Sommerfeld quantization rule 
with half-integer quantum numbers. 


PROBLEMS 


1 Suppose p is some normalized approximate wave-function for the 
ground state of a system. Define the integrals 


I= [u*(-KyPydr and Ky = [vty ar, 
where # is the Hamiltonian operator. By expanding y in terms of the 
exact eigenfunctions of # show that 

I>, —K,)* 
where F£ is the true ground state energy, if K, is closer to £, than to the 


energy of any higher state. 
Express J in terms of K, and K and show that 


E, >K, —(K, — K?)'”. 


This formula, due to Weinstein, provides a lower bound to the ground 
state energy. 


2 Equation (4.26) gives the Hamiltonian for a linear harmonic 
oscillator. Using the normalized function 


20 i 
OT QP +x?) 


obtain the expectation value of the Hamiltonian. Use the variational 
principle to show that the best choice for a for the ground state is 
(i? /2mk)' and that the best value for (#) is7i(k/2m)!/2. 
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8 Consider the one-dimensional particle whose Hamiltonian is 


| Hz it 


Using the non-normalized trial function 
W(x) = ene x? /2 
show that the expectation value for the energy is 
ho? 3k 


Using the variational principle show that the best value for @ is 
(6km/ffi2)"’6 and obtain the corresponding value for the ground state 
energy. 

You may assume the integral 


} 6? dx = Jala. 


4 Suppose a system of classical particles is in motion. Newton’s second 
law states that x; = m,X; where x; is the force in the x-direction acting 
on the ith particle of mass m;. Confirm the expression 


Xi m 2,14 
2 dt 
Show that provided the particle is confined to a finite region of space, 

the time average as t > © of the last term on the right-hand side is zero. 


For conservative systems the forces acting on the particles may be derived 
from a scalar potential. 


(™m;x;;). 


3 
XE — 5 VO M1210. va): 


If V is a homogeneous function of the co-ordinates of degree n show 
that 

nas — 

—V=T 

2 


where V and T are the time averages of the potential and kinetic energies 
of the system as t > °°, 
(You may use Euler’s theorem 


This is the virial theorem of classical mechanics. 
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5 Letr, and rz be the position vectors of two points in space relative 
to some origin. A well-known result is that 


1 1 &{n\ 
——__ = — > (2) P,, (cos @) ny >r 


—-r 
Imp tel on i=6 


1 <x re 
=— > A P,, (cos 8) ry >r, 
rz pays Lo) 


where @ is the angle between r, and r. and P,, is the Legendre polynomia 
of degree n. Another result is that 


Pn (COs 8) = Pp (COs 81) Pn (cos 42) 


n m,n — Im)! 
ee 2 +m) 


Pr’ (cos 61) Pp” (cos 92) 


x cos (¢, ~ $2) 


where (6,;, 6,) and (0, $2) are the polar angles corresponding to r, and 
rz respectively. 
Hence show that if 


3 
viewm=+(5] exp exe +1)| 


then 


e? 1 3 
© | y* ——_ dr, dry = >a ——_.. 
4n€q { Iry _ rq | v . 8 ag 4teg 


This is the result (7.24). You may assume that 
+1 
P,(u)du=0 n>0 
-1 
and that 
Po(u) = 1. 
6 Consider the eigenvalue equation (7.28). Substitute for y using 


(7.29). Premultiply the equation by x; and integrate and confirm that 
(7.35) is obtained. 


7 Confirm that the perturbed wave functions (7.58) obtained by 
choosing ¢; = 1 are orthonormal to first order. 
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8 Suppose the Hamiltonian # is a function of the parameter A 
(e.g. (7.45)). Let W,, be the normalized eigenfunction with eigenvalue 


E,Q)= | vt#n dr. 


Show that 
dEn (20H 
Dns vaS Un dr @ 


This is the Feynman theorem. Note that y,, will depend upon A and that 
a 
i [Vain dr=0. 


In the case of perturbation theory 99//0A = & (7.45). By expanding 
both sides of (i) in powers of \ show that this equation is consistent with 
the perturbation theory results (7.54), (7.58), (7.59). 


9 The Hamiltonian operator describing the Helium atom is given by 
(7.17). This may be regarded as an unperturbed system with Hamiltonian 


2 2 
t=! +33 gala a 
m 0 


and a perturbation 
oy ogee sees 
4nég Ir, _ rl - 


Show that to first order, the energy of the ground state of the 
perturbed system is 


=e (ul 
4n€gay \ 4) 


(Compare this with (7.27).) 


10 A perturbation ax* is applied to a one-dimensional simple harmonic 
oscillator (4.26). Show that the first-order change in the energy is 


3(2n? +2n + 1)h?a/m?w?. 
You may use the recurrence relation 


2yHn(y) = 2nHy_1(¥) + Anti (y) 


where H,,(y) is the Hermite polynomial of degree n, and also the 
orthogonality relation 
fo Hn) (9) dy = 2"! V8 an: 


~—oo 
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11 The Schrédinger equation for a plane rigid rotator with a uniform 
electric field & applied in the plane is 


—-r 2 (Ete @cosd)y =0 


where J is the moment of inertia, y is the electric dipole moment and ¢ 
is the angle of rotation. Regarding the term —u & cos ¢ as a perturbation, 
show that the energy, correct to second order is 
a? Ip? €? 
— m? +b 
21 n“(4m* — 1) 
Find an expression for the polarizability and deduce that the dipole is 


parallel to the field when m = 0. Comment on the results for the states 
m = +1. (Hint: remember this is strictly a degenerate problem.) 


m=0,+1,42,.... 


12 Show that the eigenfunctions for a free-state electron moving along 
the x-axis, and satisfying the periodic boundary conditions 


¥@&+L)= v(x) 


are 
1 Qn 
x) =—= exp|il—x Zinteger 
vx) VE. >| Z | i 
with energies 
h?1 
E°)=—. 
O-. 


Suppose a periodic potential 
Vx) =V@ +a) 
with 
L=Na N integer 


is introduced as a perturbation. If the mean value is zero 


2 
V@)= > Vea with Vi=—V_y. 
n#0 
Show that to second-order terms, the perturbed energy is 


IV,,|? 


EQ) =E%Q)+ > E°() —E°- nN) 


n#0 
Explain why this result is not valid if 7 =nN/2. 
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In this case show, that to first order 


= (2) -s2(2)erra 


13 The ‘phase’ of the wave-function (7.98) is 
x 
o= j k dx —1/4. 
a 


Explain why the quantization condition (7.101) implies that the wave- 
function has n and only n zeros, (nodes). 

By considering the quantization rule (7.102) for adjacent levels of 
large n and using the classical relation 


where V is the velocity and E the energy, show that the difference 
between the neighbouring levels is 


bE =Tiw 
where w = 2n/T and T is the ‘period’ of the motion. 
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CHAPTER 8 


Time-Dependent 
Perturbation Theory 


8.1 Variation of Constants 


The time-independent perturbation theory was discussed in the previous 
chapter. Then the object was to find the changes in the eigenvalues and 
eigenfunctions of the stationary states of a system when a constant 
perturbation was applied. Suppose now that the perturbation is a function 
of the time. In this case the perturbed Hamiltonian operator is time- 
dependent and stationary states do not exist. The problem is now to find 
the time-dependent state function. 

Consider a system where the Hamiltonian can be written 


KH =H? +r#Rt1) = O0<dAK1 (8.1) 


where #” is independent of time and 4(f) is a function of time. It is 
assumed that the energy associated with 4(r) is small compared with 
that associated with #9. As before the parametric value \ = 1 corre- 
sponds to ‘complete’ perturbation and A = 0 to zero perturbation. 
2° must be chosen so that its eigenfunctions are known. Let the 
orthonormal set {x,} be these time-independent eigenfunctions. 


HN = ERX K- (8.2) 


H° is assumed to possess a discrete spectrum. 
The general solution of the unperturbed time-dependent wave 
equation 
A ay°® 


HY? = —— 
i ot 


(8.3) 
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|can be expressed as a linear combination of the variable separable 
solutions (3.15), i.e. 


Vo = 2 a, exp [—iE2 t/t] xx. (8.4) 


The a;,’s are constants. 
The perturbed time-dependent wave equation is 


(#° Aye = — 25% (8.5) 


At any instant, Y can be expanded as in (8.4) but the values of the 
expansion coefficients will no longer be constants in time. 


v= Zax (0) exp [—iEpt/hl xx. (8.6) 


Substitute (8.6) into (8.5) and with the help of (8.2) it is easily shown 
that 


da 
x Xan h eB Kelty = -_2 i ae k e HE Rtimy 


+ 70 
Multiply this last equation by e'“it/" y* and integrate over the space 
co-ordinates to obtain 
h da 


d DaghpetEr- Ewe = — — 
k 


1=1,2,... 8.7 
i dt (8.7) 


where 
h(t) = [xt Axx dr. 


The equations (8.7) are the basic equations of the method of variation 
of constants and determine the expansion coefficients. However to solve 
these equations it is necessary to use perturbation theory. As in the 
previous chapter (7.49) it is assumed that the wave-function is a con- 
tinuous function of A so that it is possible to expand a, as a power series 
ind 


ay =a tra +... (8.8) 


Equation (8.8) may be substituted into (8.7) and the resulting 
equation regarded as a power series in A. The zero-order term gives 


daf® 


ih a 


=0 1=1,2,... (8.9) 
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| 
showing that the zero-order expansion coefficients are constants. The 


first-order term gives 


(1) 
ih da; 
dt 


= % af My ft ERI IBA es (8.10) 


The general power A” yields the higher order perturbation coefficients 


da™ 
ara 5 a"—Dny efEi-Etmh =p =1,2,.... (8.11) 
k 


The zero-order coefficients, (8.9) do not change with time and their 
values are specified by the initial state of the system. Suppose the system 
is in the ith stationary state of the unperturbed Hamiltonian when the 
perturbation is introduced at ¢ = 0. Then 


a™=1 a=0 f#i (8.12) 
The first-order changes in the expansion coefficients are then obtained 
by solving the equations 
da 4) 0 
ih sap =hyelEr-Epem f= 1,2,.... (8.13) 
The solutions satisfying the initial conditions af?) =O are 


t 
i 
a\)(t) = -F [ane el(EZ-Eit/M ap = f= 1,2,.... (8.14) 
0 


In the simple case that the perturbation does not depend explicitly on 
the time but is switched on at time ¢ = 0 and switched off at ¢ = t’, the 
solutions of (8.14) are 


ihe el(EF ~E})t Mm 


a(t’) =hy Be =o fFi 
i 
Oey") = hi, 
a; A ala cone (8.15) 


These values remain unchanged for t > ¢’ after the perturbation is 
switched off and the wave-function which for t <0 was x; exp -iE?t/h] 
is 


v= [1 +aD(1')] eH thy + > al (ee EFM, t>t! 
fHi 


where the coefficients are given by (8.15). If the set {x ¢} is orthonormal 
then this wave-function is normalized to first order. 
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The product a$)*(1)a ap (0) gives the probability of finding the system 
in the state x ¢ exp (—iEft/h) after a time ¢. 


la? = lag jp Er Ee egy (8.16) 


: EP\: 0<t<?'. 
(A 


The system may be considered to make transitions into all possible 
states. The expression (8.16) is periodic in time and the time At of a 
transition is of the order of h/(Ef — E?). Strictly EP is not an eigen- 
value of the perturbed Hamiltonian but if the perturbation is small 
enough it t may be taken as a system eigenvalue. If the energy change 
(EP — E?) is significantly different from zero the time spent in the upper 
state is small. This is to be expected as in such transitions energy is not 
conserved and the change in energy AE = (EP ff — — E?) and the time 
available for measurement must be consistent with the uncertainty 
principle, 


(AE)(At) ~ h. (8.17) 


Energy is conserved within the limits set by the uncertainty principle. 

At time ¢ > t’ after the perturbation is switched off the probability 
of finding the system in the exact eigenstate x exp (-iE? 'f t/h) is simply 
(8.16) with t replaced by t’. If the matrix element hy; is zero the tran- 
sition from the ith state to the fth state cannot take place by a ‘first- 
order’ process. However, if hj: and hg are both non-zero then this 
transition may take place by a ‘second-order’ process through the 
‘virtual’ state 7. The details of this depend upon the second-order change 
in the wave-function and will not be dealt with here. 

The expression for lafPe’) |? may be written 


0 0 

| a y= Hak sin ae) wank _GF =Ei) 
x? 2h 
The function sin? aku is sharply peaked about x = 0 with zeros at 
xt’ =ina,n=1,2,.... This is another expression of the result that 
energy is conserved i in transitions, the most probable transition being 
those for which E? ~ Ef. If the perturbation is applied for a very small 
time ¢’ then ae |? is proportional to t'? and the transition prob- 
ability per unit time is not a constant. This undesirable result is avoided 
if it is assumed that there is a continuous distribution of states. 
Suppose it is desired to find the probability that a system originally 

in the ith state will finally be found in the continuous range 


sf of 


attics SS fa 
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The probability of finding the system in one of these states after the 
perturbation is switched off is | 


flaPe>P a (8.18) | 
af 
f is a continuous variable (eigenvalue) that defines the state uniquely. 

This can be written 


foe Pyare? ak, (8.19) 
6E 


where p(E, ? ) is defined so that p(E?) dE, is the number of states in the 
energy range E ‘ + dE, and the range of integration 5£ corresponds to 8f. 

For ft’ sufficiently large (so that 5E > h/t’) the transition probability 
has the properties of a delta function (see equation (3.36)) and (8.19) 
may be written 


2nt' 

AE | Ina oeP IEP ~ EP) aE, =" tli lo). @.20) 
bE 

This result assumes that the energy range 5F includes E? and that p(EP) 
is a slowly-varying function. This probability is linear in time and the 
transition probability per unit time P, that the system undergoes a 
transition from the initial state i to one of the final states is 


20 
== pEP hp? (8.21) 


P is a constant as expected and equation (8.21) is Fermi’s golden rule. 


8.2 Semi-Classical Treatment of Radiation 


In this section the interaction between a material particle and electro- 
magnetic radiation will be discussed. The particle will be assumed to 
obey quantum mechanics and the radiation to satisfy the classical Maxwell 
equations. Clearly such a semi-classical hybrid of quantum and classical 
mechanics cannot be completely satisfactory and the radiation field should 
be quantized and described in terms of photons. Such a treatment is 
beyond the scope of this book and the interested reader is referred to 
Introductory Quantum Electrodynamics, by E. A. Power. 
Classically, an electromagnetic field can be described by the scalar and 
vector potentials ¢ and A with 
dA 
B=curlA and &=—grad ¢ — an ; (8.22) 
where B and E are the magnetic induction and electric field vectors 
respectively. The vector field A can be resolved into an irrotational and 
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a solenoidal part. The first of equations (8.22) defines the solenoidal 
| part only and in free space the vector A can always be chosen to satisfy 
| 1 a¢ 

divA+ >> = : 
iv arr 0 (8.23) 
'¢ is the velocity of light. 
It has already been shown (7.73) that the Schrédinger equation for an 

, electron of charge —e and mass m in an electromagnetic field is 


h? 242 
— 5 V? = ihA. grad — =~ ih div A + ~eo|¥ 
_ ay 
=n (8.24) 


More than one set of scalar and vector potentials describe the same 
electromagnetic field. If x is a solution of the wave equation 


ay 
Vx - 3573 =0 (8.25) 


then new potentials 
Ul Ul a 
A’ =A — grad x g=o+> (8.26) 


can be defined which still satisfy (8.22) and (8.23). Such a transformation 
is called a gauge transformation. In regions where there is no free charge 
the scalar potential must also be a solution of the wave equation (8.25) 
and in this case x can always be chosen so that the scalar potential is zero 
and from (8.23) div A = 0. 

With this choice for A and ¢ and if the field is weak so that the term 
in A? can be neglected, the Schrédinger Hamiltonian becomes 

h? ee 
on Vv = ihA. grad. 

If the electron is bound to an atom it is necessary to introduce an 
additional potential energy term V to represent the central field and it is 
then appropriate to use the reduced mass y for the electron. The 
Hamiltonian is then 


#2 — 57 VP + V Tina. grad. (8.27) 


This Hamiltonian can be written 
H =H +h 
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where #° is the Hamiltonian for an electron in a central field and 4 is 
the perturbation operator 
é, e 
4 =-——ihA.gad=+—A.p (8.28) 
We u 
where p is the operator representing the linear momentum of the electron 
For a monochromatic plane wave, the real vector potential is 
A= (Ao eit) i(k .r—wrt) + (Ao ee ere) (8.29) 


where Ag is a constant real vector and w is the angular frequency. To 
satisfy div A = 0 it is necessary for 


Ag-.k =0. 


That is, the plane of polarization of the wave is perpendicular to the 
direction of propagation. 

The expansion coefficients can be found using the result (8.14). If the 
perturbation is switched on at f = 0 when the system is in the unperturbed 
atomic state i then to first order 


| ' eilwy —w)t =] . elo t wt asf 


hy fi (@pto) | (8.30) 


(wy — w) 


e “4 
hp | xpe®F Ag. px; dr 


ey -i “ 
hn= 7 e ml xge WRT AY py, dt 


and 
wpa = (Ef — Ep yh. 


If (coi + w) is not close to zero then a(t) oscillates very rapidly and 
the time spent in the state f is very small. Following the argument from 
(8.14) to (8.16) the probability of finding the system in the state f at a 
time ¢ is 


-2 2 
sin® (wy — w)t/2 +w)t/2 
POP sing? aN 5 ing 2 te ns one 
(2a—*) (228) 
2 2 


+ (product terms). (8.31) 


In all cases the product terms in the expansion for |a#(t)|? are small 
and can be neglected. 
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The probability of finding the system in the state f at the time f is 
appreciable only if the frequency of the incident radiation is such that 


Wy — 0 ~0O or wy tw =O. 


| These are the cases of resonance. At most one of these conditions is 
satisfied for a given frequency and two given energy levels. Transitions 
' between the states i and f are probable only if 


Ef ~ EP = thw. (8.32) 


The positive sign applies when the particle absorbs the energy Aw from 
the field. If (8.32) is satisfied with the negative sign the particle loses 
energy fiw to the field. 

In atomic spectra the initial and final states are discrete and the 
transition probability per unit time would appear to be time-dependent 
(c.f. (8.16)). The difficulty is overcome by remembering that for a light 
source the radiation is produced by the random vibration of atoms that 
are widely-spaced. In practice the radiation is not strictly monochromatic 
but covers a small but finite range of frequencies 

bw 5w 
w 5 <wcwt 7? 
the frequencies having random relative phases. In this case interference 
effects can be neglected and the contributions from the different 
frequencies are additive. 

Consider the case of absorption where energy is absorbed from the 
field. In this case wyj ~ w and only the first term in (8.31) need be 
considered. The total probability is obtained by integrating this term 
over the frequency range 5w. The delta function properties of (8.31) 
give the probability for absorption (upward transition) to be for t 
sufficiently large, 


J aS (t)I? deo = 2nt [jz [5 (wp; — w) deo. 
Sw bw 


The integral is independent of time and so the transition probability per 
unit time is a constant. 
P=2n | hyp |? 5 (cog — w) dew. 
bw 


The magnitude of the vector potential Ag is a function of the 
frequency and this integral can be written 


2ne? * a pe 2 
P= pi Adwn)| |xr e* "box; dr (8.33) 
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where fo represents the momentum component along the direction Ag. 
The use of the delta function is equivalent to assuming that A3(w) is a 
slowly varying function of w. 

The transition probability depends upon the intensity of the radiation 
at the ‘resonance’ frequency. At any time instant the rate of flow of 
energy per unit area is given by the Poynting vector 


N=8AH. (8.34) 


For a plane wave in free space 


M=, [6 


where E is the magnitude of the electric field. The value of |N| associated 
with the frequency w and averaged over one oscillation of period 27/w 
is (8.22), (8.29), (8.34) 


2 
a A3(w) = E(w) 


where E(w) is intensity of radiation at frequency w and Mg is the 
permeability of free space. From (8.33) 


2 
_ MUoce ik. ™ 
p=+=-> E d 
went (wp)| [xfe Pox: dr|” (8.35) 


where EP ~ EP thw. 
Consider now the case of emission where the energies are such that 
Wri = —W, i.e. 


EE? ~E? — hw. 


The second term in (8.30) is now dominant and the probability for the 
downward transition from i to f is simply (8.35) with e’*-* replaced by 
e—'k-¥, This is the result for emission induced by the presence of a field. 
Note that not only is the absorption probability proportional to the field 
intensity as expected but that the stimulated emission probability is also 
proportional to the intensity. Also the probabilities for induced upward 
or downward transitions between a given pair of states in a given field 
are equal and ee re to the integral 


| | ute "Box: dr|’. (8.36) 


This is an example of the principle of detailed balancing. Jn an upward 
transition a photon (a quantum of radiation energy) is absorbed from the 
field and in a downward transition a photon is emitted by the particle. 
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ne electric-dipole approximation 
the exponential in (8.35) can be expanded 


2 
| g@k-r =] ike EP $25 %. (8.37) 


| 
[he wave-length of visible radiation is much greater than atomic dimensions 
ind i in the vicinity of the atomic system where the wave-functions x and 
(; are appreciable, k .r is much smaller than one. 

In the dipole approximation (8.35) is evaluated with e'* -* replaced by 
unity and the induced probability per unit time is proportional to the 
square of the integral, 


[xf Box: dr. 
This approximation will not be valid for X-rays. Following an argument 
similar to that used in Chapter 5 (5,70) it can be shown that 
eo _ 4d * 
|xfeox. dr= my | Xrroxi dr 
where ro is the component of the electron displacement r, relative to the 
origin, parallel to the direction of polarization. (—er) is the electric 


dipole moment of the electron. From the generalization of (5.47) 
(question 4, Chapter 5) 


d 
a sy |xtroxi dr= “(Ef ~ EP) i} XFToX: At. 
Finally, the integral in (8.35) becomes 


if XF Pox: AT = iNwy; J XF Tox: ar. 


The transition probabilities per unit time for absorption and induced 
emission are now 


ioe THoee” won| [xt XfToXi dr|”. 


If the radiation field is isotropic this becomes 


Tipce” 
3h? 


E(wy)| \xdex: dr” (8.38) 


where r = ix + jy +kz and E(w,;)/3 is the intensity in the x, y, z 
directions. The coefficient of the intensity in this formula is the Einstein 
B-coefficient of absorption or induced emission. 
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If the dipole integral f XFIX ; 7 is zero for a pair of states the tran- 
sition probability vanishes in this approximation and the transition is 
said to be forbidden. In fact the transition probability may not be 
exactly zero as there may be contributions from the other terms in the 
expansion (8.37). 

If the expression (8.35) vanishes the transition is said to be strictly 
forbidden. Even in this case, however, the transition may take place by ; 
second-order transition. The reader is referred to Quantum Mechanics, b 
Schiff, p. 254, for a deeper treatment. 


Spontaneous emission 


Even in the absence of an external field there is a chance that an atom in 
an excited state with energy £; will make a transition to a lower energy 
state Ey with a corresponding emission of radiation energy. It can be 
shown (Schiff, Power) that the transition probability per unit time for 
such a spontaneous emission is again proportional to the square of the 
modulus of the dipole integral, i.e. 


| fxtexi ar’. 


Selection rules 


The conditions which the functions x; and x¢ must satisfy to prevent th 
dipole integral (8.38) from vanishing are called ‘selection rules’. Note 
that if both functions are spherically symmetric (atomic-states with / = 0 
the integral vanishes and the transition is forbidden. Similarly, using the 
hydrogen-type wave-functions it can be shown that the electron transitic 
is ‘forbidden’ unless the changes in the / and m; quantum numbers satisfi 


and 


5m, =0 or +1, 


This is dealt with in more detail in Chapter 11. 

In Chapter 6, problem 14 it is shown that the matrix elements of Xp, 
with respect to the harmonic oscillator eigenfunctions are zero unless 
n=m +1. This implies the selection rule 


so that the spectrum should consist of a single frequency equal to that o 
the oscillator. This has already been mentioned in Chapter 4 with referer 
to the vibrations of diatomic molecules. 
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Line breadth 

The uncertainty principle can be expressed in the form 

(At)(AE) > h/2 


where (Afr) is the uncertainty in the time measurement and (AF) is the 
uncertainty in the energy. 

The probability per unit time \ for a transition from a state i toa 
lower state f of a system is independent of the time. Such a time- 
_independent process obeys an exponential law of decay and 1/) is of the 
order of magnitude that the system stays in the upper-state. Clearly a 
determination of the energy of this upper state cannot take longer than 
the lifetime 1/A of the state and so the uncertainty in the energy is 


(AE) > A/2. 


This uncertainty in the energy corresponds to a broadening of the 
emitted spectrum line (uncertainty in the angular frequency of 4/2) and 
in general, the longer the lifetime of the state the sharper the spectrum 
line. This broadening imposed by the uncertainty principle is very small 
and is of the order of that observed although there are other factors such 
as radiation damping which affect the observed line width. 


8.3 Elastic Collisions 


Scattering cross-section 


Suppose a parallel stream of non-interacting particles impinges upon a 
fixed target. Let the density of incident particles be such that n particles 


f 
aa} 


Incident Target 
™ Incident 
beam 
Figure 8.1 
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cross a unit area perpendicular to the flux in unit time. The incident 
particles will be scattered due to the interaction with the target and the 
number of particles scattered into an element of solid angle 62 will be 
proportional to 5Q. The constant of proportionality o has the dimen- 
sions of area and is called the differential cross-section. It is not a 
constant but depends upon the polar co-ordinates (0, ¢) of 59? relative 
to the incident flux. 

o(6, @) is the probability per unit time for a particle to be scattered 
into unit solid angle about (6, ¢) when a beam of unit flux bombards the 
target. The total cross-section is defined to be the integral over the solid | 
angle 


Qn tt 


{o@,¢)ar= { { o@, 4) sine 40 ag. 
00 


Quantum theory of scattering 


The result of a collision in classical mechanics is determined by 


(a). the initial particle velocities; 

(b) the distance at which the particles would pass if there was no 
interaction; 

(c) the interaction potential between the particles. 


Since the concept of particle trajectory loses its meaning in quantum 
mechanics a different approach is required. 

The quantum mechanical Hamiltonian representing a system of two 
particles whose interaction depends only on their relative displacement 
commutes with the operator representing the total linear momentum of 
the system. So the expectation value of the momentum in any state that 
satisfies the Schrédinger equation is a constant. Linear momentum is 
conserved in collisions in both quantum and classical mechanics. 

In Chapter 4 it was shown that if the potential in a two-body problem 
depends only on the relative displacement of the particles the Schrédinger 
equation can be split up into two, one of which describes the motion of 
the centre of mass and the other describes the motion of a single particle 
with reduced mass p in the field due to a fixed centre of force. 

The scattering problem can be reduced to a single-body problem by 
choosing the centre of mass co-ordinate system in which the centre of 
mass is always at rest. Experimentally a more obvious system is one in 
which the target particle is initially at rest. This is the laboratory co- 
ordinate system. The differential cross-section is not the same in both 
systems. However the total cross-section is obviously the same in both 
systems as it refers to the total number of particles scattered in all 
directions. 

The centre of mass system is used below. 
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Consider a collision in which an incident particle is scattered by a 
target. If the ‘internal energy’ of the two particle is left unchanged, the 
collision is said to be elastic. In the centre of mass system, the problem 
is to solve the equation 


2 
_ on V7V(r) + VW) = EVs) 


where £ is the energy of the relative motion. The precise form of the 
scattering potential V(r) may be quite complicated and consequently 
most collision problems cannot be solved analytically. However it is not 
difficult to show that at large distances from the scatterer the wave- 
function will take the form 


- elkr 
en + £0) — 


The first term represents a particle incident along the positive z-axis 
whereas the second term represents the scattered wave. The problem is 
to find f (6, ¢). 

Collision problems can be solved by time-dependent perturbation 
theory. In this treatment the short-range interaction is regarded as a weak 
perturbation which is zero initially and decreases to zero after the 
collision. Equation (8.21) will now be used to evaluate the transition 
probability when the initial and final states of the scattered particles are 
free-particle momentum eigenfunctions. 

The time-independent part of the wave-function of the incident 
particle is, when normalized to give unit incident flux 


u i 
Xi= lk exp (ikg .r) (8.39) 


This represents particles with momentum hk. The wave-function of a 
scattered state is, using the 5-function normalization for a continuous 
spectrum, 


Xk = rs exp (ik .r). (8.40) 


For elastic collisions the internal energy of the particle is left 
unchanged and 


Ko =k. 
The change of momentum of the incident particle is 


h(k — ko) = —AK. 
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The perturbation matrix element corresponding to the initial and final 
states (8.39) and (8.40) is | 


—a | 
hy = Tce | VO exp OKs) ar (8.41) | 


where the potential V(r) is due to the interaction between the incident | 
particle and the scatterer. This integral depends in general on the direction 
of scattering. The probability of finding the scattered particle with wave ' 
vector in the range k + 5k and in the solid angle 62 = sin 0505¢ after 
time f’ is (8.18) 


(1a er') 2a? ak aa. (8.42) 
5k 


Since E,, = h?k? /2p then (8.20) this can be written 
(Ag 2 pk 2n/h?)6Q2'. (8.43) 


Hence the number of particles scattered into the solid angle 6Q per unit 
time is ((8.41), (8.43)) 
2 


aaat | J Viryel®-* dr)"62 


and so the differential cross-section is 


2 
Be Gems iK. 2 
o(8, ¢) = ( =| | {veel -* ar]. (8.44) 

This is the result of the first Born approximation and will be valid if 
the interaction potential V(r) is small compared with the kinetic energy 
of the incident particles. 


Figure 8.2 
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The simplest scattering potential is that of the spherical well 
Vir)=-Vo r<a 
=0 r>a. 


In this case the polar axis can be conveniently taken along the 
momentum transfer direction K. 


[v@ex dr=Vpo iff eiKr cosB,2 sin B dy dB dr 
0 


a 


Vo | sin (Kr)r dr 
0 


K can be expressed in terms of the scattering angle 
K = 2g sin 6/2. 


The differential scattering cross-section is 


2pVoa>\? [ sinu — u cosu\? 
oc = (HE) ics aka ld 


where 


) 
pa ge 


Figure 8.3 
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Figure 8.4 Differential scattering cross-section for a spherical potential-well. 
This extremely brief treatment of the theory of collisions has been 

given primarily to illustrate the time-dependent perturbation theory. 

The reader is referred to Burhop’s review article ‘Theory of Collisions’ 


in Quantum Theory, Elements, edited by D. R. Bates, for a fuller 
treatment. 


PROBLEMS 


1 In the laboratory co-ordinate system the stationary target mass 7712 is 
initially at 0 and the projected mass m, has a velocity (0, 0, V). If after 
the collision, assumed elastic, the particles are scattered through 6, and 
62 with velocities V, and V2 (see Fig. 8.5) show that 


my y? =m,V2 +m V3 
m,V=m,V, cos6, +m zV2 cos 9 


mV, sin 6, =mV2 sin 85. 


Figure 8.5 
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Show also that the centre of mass moves with a velocity (0, 0, V9) where 
Vo = m,V/(m, +m). 

In the centre of mass system the total linear momentum is zero and 
the momenta of the particles are the same in magnitude but opposite in 


Figure 8.6 


sign. In Fig.8.6, C is the fixed centre of the mass, u,, u2 are the initial 
velocities and V;, V2 are the final velocities. Explain why for elastic 
collisions Vj; =u, and V =u, and show that 


V, sin @, =u, sin @ 

V; cos a, = ty cos @ + Vo. 
Hence show that 

my sin 0 
tan 0, oS and 6, =4(n — 84). 
m, tmz cos 6 

(Hint: observe that V, is the resultant of V; and the centre of mass 
velocity (0, 0, Vo).) 
2 Let cand o’ be the differential cross-sections in the laboratory and 
centre of mass co-ordinate systems respectively. Then 

0(6,) dQ = 0'(8) da’ 
where dQ and d9Q’ are the corresponding respective solid angles. Show 
that (see problem one) 


(1 +41c0s 8 
ELS eT 


sin 0, dé, (4p? + Iu cos 6)? +e + 2g cos 6)” 
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| 
where uf = m,/m, and hence show that 
| 
(1 +p? + 2u cos 0)>/? 


0(6;)= o'(8) If 442-008 61 


3 Maxwell’s equations for the electromagnetic field are 


curl @= — = curl H=j +52 
divD=p divB=0 

with 
D=€,€98 B = u,>UoH 


where €,, U, are the relative permittivity and permeability of the medium. 
Show that the electric and magnetic induction vectors can be expressec 
in terms of the scalar and vector potentials ¢ and A such that 


) 
= — grad d, B=curlA (i) 


with 
2 
2g eer O_o 
VO— oF €,€o i) 
and 
Myé, 7A e. Oa 
v2a — Beet oF = ured (tt 
provided 
; Mr€, O¢ 
divA=-——> —. 
ss c? at 


More than one set of potentials specify the fields. Show that equations 
(i), (ii) and (iii), are invariant under a change to the new potentials 


, t ox 

= — = + 

A =A -— grad x, = at 
provided 


— Mr€r Ox _ 


2 
0. 
vx c? ar? 


This is called a gauge transformation. 
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4 (i) Prove the vector identity 
div (@A H) =H. curl — &. curl H. 
(i) Write down Maxwell’s equations for free space (problem 3) and show 
at 


0 
H.curl @-— &. curl H= — 57 GHoH® +4698). 


The energy density of the electromagnetic field is 
duoH? + hegF. 


Use the divergence theorem to show that the flux of energy out of a 
closed surface s is 


§N.nds 
Ss 


where n is the unit outward normal and N is the Poynting vector 
N=@AH. 


5 The commutator for two quantum operators f, A is related to the 
corresponding Poisson bracket of the two dynamical variables p, A by 
(3.6) 


[p, A] =ih{p, A}. 


Show that if p is the momentum operator and A is a function of the 
position only then 


[p, A] =-M divA. (i) 


(Hint: use equation (1.28).) 
The classical Hamiltonian for an electron in an electromagnetic field is 


2 
a, 


H= 
2m 


Lee Renee a7 cee 

2m °° PP 2m . 

(Chapter 3, problem 2). Explain how equation (i) can be used to obtain 
the correct quantum Hamiltonian of (8.24). 
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Group Theory 


9.1 Symmetry Operations 


Many physical systems possess symmetry properties which have 
important consequences for the occurrence of energy levels. For 
example, crystalline solids have a periodic structure and this translational 
symmetry enables the energy eigenfunctions to be classified by a wave 
vector k. These functions are the Bloch waves and will be referred to 
again later in the chapter. 

Although finite molecules cannot possess a periodic structure they 
can have other symmetry properties, such as reflection planes and 
rotation axes. 


Figure 9.1 Plane of symmetry. 
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A plane of symmetry is a plane in the system such that each point on 
one side of the plane is the mirror image of a corresponding point on 
the other. 


Mirror 
plane 
Phx _-*P 
~ - 
~ - 
~~ ~ ” ee, — . 
Seo Se Tae hee TTL Inversion 
ao axis 


Figure 9,2 


An axis of symmetry is an axis such that the system takes up an 
identical position on rotation through a suitable angle about it. The 
angle of rotation must be of the form 27/n where n is an integer and 
such an axis is called an n-fold axis. 

A centre of inversion is a point such that the system is invariant 
under the operation r > —r where r is the vector displacement of any 
point in the system referred to the inversion centre. 

A system possesses a rotation-inversion axis if it is brought into self- 
coincidence by a rotation followed by an inversion. 

Some symmetry operations are equivalent. For example, a two-fold 
inversion axis is equivalent to a mirror plane perpendicular to the axis 
(Fig. 9.2). 


Symmetry operators 


Mathematically, a symmetry operation such as a rotation or reflection 
involves a linear transformation of the co-ordinates. There are two 
distinct ways of thinking about such a rotation. A physical rotation of 
a body through an angle a in the clockwise direction about an axis, say 
the x3 axis, is equivalent to a rotation of the complete axes system 
through the same angle but in the anti-clockwise sense in that the 
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relation between the new and old co-ordinates of any point in the body 
is the same in both cases (Fig. 9.3). In each case 


, , . 
X, =X, cosa —X, sina 
, . ‘ 
X_ =X, sina +x cosa 


x3 = x4. (9.1) 


where x1, X2,x3 and x}, 3, x3 represent the old and new co-ordinates 
respectively. The first point of view in which the system is imagined to 
undergo an actual rotation is called the ‘active’ viewpoint and is perhaps 
the easiest to interpret. However, mathematically it is far more convenient 
to use the ‘passive’ viewpoint of axes rotation and this will be done here. 


(a) = 


Figure 9.3 (a) Rotation of Pto P’ ina clockwise sense. 
(b) Rotation of axis in anti-clockwise sense. 


A general rotation, reflection or inversion can be represented by a 
linear relationship of the form, 


, 
Xy = yyXy +Qy2X2 +4 43x3 
, 
XQ = Aq1Xy + Ag9X2 + A23x3 
, 
X3 = QgyX1 +32X2 +033x3 (9.2) 
. . , , ’ 
where x1, X2, 3 are the old co-ordinates of a point and x1, x2, x3 are 


the new co-ordinates of the same point. Equations (9.2) can be 
represented by the simple matrix equation 


x’ =ax (9.3) 
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| where @ is the matrix 

Oi 12 13 
B=] 91 M22 M3 

C31 032 33 


and x and x’ are the column vectors 


*1 x4 

x2 and x2 

x3 x3 
respectively. 


For an inversion, a rotation about an axis, or a reflection in a plane 
through the origin, the distance of a point from the origin remains 
constant. 


V t 
=x? 4x3 4x3 =x7 tx? +xP. (9.4) 


The matrix @ is an orthogonal matrix and its determinant takes the 
value +1, the negative sign belonging to an operation involving an odd 
number of reflections. 

The effect of any operator on a function f(x) is to transform it into 
another function. For example the differential operator d/dx changes 
x” into nx"—!. It is now necessary to define the effect of a symmetry 
operator A on an arbitrary function. In the passive convention, the co- 
ordinate axes are rotated and the value of the function at each point in 
space remains unchanged. The transformed function is easily obtained 
by replacing the old co-ordinates by the new co-ordinates according to 
(9.3), i-e. 


Af (x) = f@™'x’) (9.5) 
where o is the transformation matrix corresponding to the operator A. 
It is convenient to drop the dash and 

Af (x) = f(a x). (9.6) 


The operator is a substitutional one and is linear. 
As an example consider the two functions 


di(x, y)=2xyf(7), dale, y) = @&? - y?) FO). 9.7) 
The operator representing a rotation of the axes through 7/4 radius in the 
anti-clockwise about the z-axis may be written 8,. The old co-ordinates 
are given in terms of the new by (9.1) 


eek 


1 , t t ft wt 
eo y V5 & t¥)s zEzZ 


x= 


1 -1 0 
1 

a} =— 1 0 
V/2 

0 1 


8, d(x, vy) =2 x4’ ~ yO’ ty’) 
=? —y") f(r’) 
=do(x',y’) (9.8) 


The function d, evaluated at the point (x, y) is equal to the value of dy 
at the same point with new co-ordinates (x’, y’). 


9.2 Groups 


A group is a set of mathematical elements {A} which obey certain 
specified relations. In this book the elements will be taken to be sym- 
metry operators but the concept is much more general. The members 
of a group have the following properties. 


Figure 9.4 
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[, Closure property 


it must be possible to combine the elements in pairs to form a new 
slement of the group. That is, the group is closed with respect to some 
(binary) operation, e.g. 


A=BC (9.9) 


In this book the group elements are transformation operators and 
‘9.9) means that if the operator C is applied first, and B second, the 
result is equivalent to applying the operator A alone, where A is another 
group element. In general the order of the operators is most important. 
Consider a point P on the y-axis together with a plane of symmetry m, 
in the yz plane and a four-fold rotation axis lying along the z-axis. 

If the operator m;, is applied first, followed by an active-clockwise 
rotation 4z, the point P moves to P’. However if the order of the 
operations is reversed, P moves to P”. The two operations do not 
commute. 

However, any two rotations about the same axis do commute. 


2. Associative property 


The group operation is associative and the combination of three 
operators is uniquely defined, i.e. 


A(BC) = (AB)C = ABC. (9.10) 


3. Identity 


One of the group elements is the identity or unit element E such that it 
commutes with all other members of the group, i.e. 


EA=AE=A. (9.11) 


4. Inverse 


Each element A in a group has an inverse A~ so that the combination 
of an operator with its inverse, results in the unit operator, i.e. 


AA =A TA=E. (9.12) 


The group 3m 


Consider a right pyramid with an equilateral triangular base and its 
central vertex above the centroid of the base. The pyramid is not a 
regular tetrahedron so that the three faces meeting at the central vertex 
are isosceles and not congruent to the base. This is the structure exhibited 
by the ammonia molecule NH3 with the three hydrogen atoms situated at 
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LI | 
I The origin is at the 
ms centre of the triangle. 


Figure 9.5 The ammonia molecule. 


the vertices of the equilateral triangle and the nitrogen atom at the 
remaining vertex. The group 3m consists of those symmetry elements 
which leave this pyramid invariant. This is also the symmetry group of ar 
equilateral triangle if the two plane sides are not identical. 

There are six symmetry operations which leave this system invariant. 
They are, 


(a) the identity operation E. 

(b) the three mirror planes of the type m. 

(c) the two rotations 3,, 32 corresponding to an active clockwise 
rotation of 120° and 240° about the z-axis respectively. 


These six operations may be shown to form a group, and Table 9.1 is 
the ‘multiplication table’ for the group. The fact that such a table can be 
constructed shows that the elements must form a group. 


Table 9.1 Multiplication table for the group 3m. 


Applied 
second 


In Table 9.1 the elements in the top row are applied first and then the 
operation in the first column. For example, if the reflection m, is applie 
and then followed by the rotation 3,, the result is equivalent to the 
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reflection m . However, if 3, is applied first and then 77,, the result is 
m3. This illustrates a point already mentioned that the order of the 
operations is important, as they do not necessarily commute. 


' The group 6m2 


Suppose in the previous example the two faces of the triangle were 

, identical. The system would then be invariant under twelve symmetry 
, operations constituting the group 6712. Apart from the six operations 
already discussed this group includes, 


(d) the three rotations of the type 2, through 180° about the vectors 
1, %2 and r3 respectively. 

(e) the mirror plane m, perpendicular to the z-axis and including the 
triangle. 

(f) the two operations 6, involving a rotation through 60° and 300° 
respectively, about the z-axis followed by an inversion through the 
origin. 

It is left to the reader to construct a multiplication table for this 
group. 

This symmetry group is of importance in that it is related to that of 
an ion surrounded by three molecules of water in the hydrated crystal 
of a salt. The ozone molecule has almost this symmetry. 


9.3 Matrix Representation 


The reader was introduced to the concept of a function space in Chapter 
3 (definition 6). A function space {6;} is said to be invariant under the 
set of operators {A} if all transformed functions of the type A6; belong 
to the space. The Cartesian vector space, defined by the unit vectors 
{é;}, is invariant under either of the two symmetry groups 3m or 6m2. 

If the space {@;} is invariant under the group {A } the transformed 
functions may be expressed in terms of the original set, i.e. 


AO; = 2, Omi9m: (9.13) 
m 
In matrix notation A® = a where @ is a row vector. The matrix a 
represents the effect of the operator A on the basis functions {0;} and 


each operator in the set has a matrix representation. If B is another 
operator in the set then 


BO=0R (9.14) 
The effect of the multiple operator AB is easily seen to be 


Jom 
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if the operators are linear, i.e. 
| 
AB6 = Oa8. (9.16) 


The matrix corresponding to the product AB is the product @® of the 
separate matrices representing A and B. 

The matrices obey the same group multiplication table as the | 
operators themselves and are said to form a representation of the group. , 
The order of the matrices gives the dimensionality of the representation. 

The set of functions {6 ;} can be used to define the same vector space 
as {0;} if the number in each set is the same and if the new basis functions 
can be expressed in terms of the old by some linear transformation, i.e. 


65 = 358i. (9.17) 
In matrix notation, 
6’=0S (9.18) 
where 9’ and @ are row vectors. Conversely 
6=0'S". (9.19) 
The representation of the operator A in the new basis is given by 
A@’ = A@S 
= @aS 
=6'S aS. 


That is, the new basis functions transforms according to the representation 
Sas. (9.20) 


This new representation obtained by this similarity transformation is said 
to be equivalent to the original representation a. A change of basis 
functions, as described above, is equivalent to choosing a new set of 
vectors {é;} in Cartesian space related to the old set {é;} by some rotation. 

The matrices in equivalent representations corresponding to a given 
operator are not in general identical but their traces are equal (6.22). As 
the trace of a product of two matrices does not depend upon the order 
of the factors 


Trace (S~aS) = Trace (SS~'a) = Trace (a). (9.21) 


Irreducible representations 


The function space {6;} is said to be irreducible under the set of operators 
{A} if with any function x; in the space, the set of transformed functions 
{Ax;} defines the same space where A runs over all the members of {A} 
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in turn. The Cartesian space {6;} is irreducible under the set of three 
operators consisting of a rotation of 90° about the x-axis, and 90° about 
y-axis and 90° about the z-axis. 

On the other hand, this Cartesian space is not irreducible under the 
set of operators consisting of the identity and rotations of + 120° about 
' the (111) axis. To see this, consider the unit vector along the (111) axis. 

A set of matrices representing a group of transformation operators 
with respect to a set of basis functions {6;} is said to form an irreducible 
representation, if the function space defined by {6;} is irreducible under 
the group of operators. A reducible representation can always be brought 
into irreducible form by an equivalence transformation of the type 
(9.20). 

The reducible matrices representing all the elements of the group are 
then reduced to block form, all with the same block structure. That is, 
each matrix is a direct sum of the square matrices a;, a2, .. . along the 
leading diagonal and the dimensions of corresponding submatrices are 
the same for every operator in the group, e.g. 


ay 0 0 
o || 0 
0 0 aq 
A set of submatrices forms one of the constituent irreducible 
representations of the group. 
The functions xe—", ye~’, ze—" correspond to a reducible represen- 
tation of the equilateral triangle group 3m (Table 9.1 and Fig. 9.5). 
The matrices representing the different group operators are given in 
Table 9.2. 
As an example consider the effect of applying the operator m, 
representing the mirror plane perpendicular to the x-axis, 


-1 00 
, ye", ze~") = (xe~", ye~", ze") 010 
001 


m,(xe~" 


(9.22) 


Inspection of the Table 9.2 shows that none of the six group elements 
mixes the functions xe~", ye—” with ze—" and the two sets (xe~’, ye") 
and (ze~") span separate spaces which are invariant under the group 3m. 
There is no need to carry out a similarity transformation in this simple 
case. The two separate irreducible representations are listed under A 
and £. 
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a (dt— x) 
a(dt +x) 


a 


a 
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The letters most commonly used to represent a one-dimensional 
representation are A and B, E is used to represent a two-dimensional 
representation and J to represent a three-dimensional representation. 

Another basis for the two-dimensiona] irreducible representation of 
the group 37 is (x t+iy)e—", (« — iy)e-". These functions are a linear 
combination of the previous basis functions and so produce an equivalent 
representations of the group. The transformation matrix representing the 
linear relationship between the two sets of basis functions is 


s-() |) and st =4 ( a 


The matrices of the new equivalent representation are displayed in the 
last row of Table 9.2,e.g. for 3, 


(NM ao) 


Every group has an identity representation, such as A, in which all 
the ‘matrices’ are unity. This is not a ‘faithful’ representation since the 
‘matrices’ obey more relations than the group elements themselves. 

Apart from the two irreducible representations already mentioned 
the group 3m has another one-dimensional representation. 


Table 9.3 Character table for the group 3m. 


The trace of a matrix representing an operator is called the ‘character’ 
of the operator in the representation and is independent of the choice of 
basis functions. 

Table 9.3 is a character table for the group 3m and lists all three 
irreducible representations. 


Classes 


The elements of a group may be partitioned into exclusive sets called 
‘classes’. Given any element A of a group, the set of elements B-! AB, 
where B runs over all elements in the group, is called a class. 

The repeated operation B-"AB is the result obtained by first rotating 
the system to some equivalent position by the rotation B, next carrying 
out the symmetry operation A, and then reversing the initial operation 
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by B-!. The members of a particular class must be operators of the same 
type, such as rotations through the same angle, but performed about 
different axes which are related to each other by members of the group. 

As the characters of all the elements of a class in a given representation 
are identical (9.21), it is only necessary to specify the classes in a group 
character table. 

The six elements of the group 3mm (Table 9.3) are divided in three 
exclusive classes, 


E 
my, M2, M3 
3,, 322. 
The identity operator always forms a class of its own. 

It can be shown that the number of classes in a group is equal to the 
number of irreducible representations of the group. For the equilateral 
triangle group there are three classes and three irreducible representations. 

There is a very useful relationship between the dimensionalities of the 


irreducible representations and the number N of elements in the group. 
If the ith irreducible representation has dimension n; then 


dni =N, (9.23) 


where the sum is over all the possible irreducible representations. 
Consequently if the number of elements and classes in a group is known 
then (9.23) will give the dimensionality of the irreducible representations. 

The group 37m has three classes and hence three irreducible represen- 
tations. As the group has six elements then (9.23) becomes 


12412 42? <6. 


There are two one-dimensional and one two-dimensional irreducible 
representations in agreement with Table 9.3. 


Axial rotation group 


The axial rotation group is the infinite group whose elements are all the 
possible rotations about some axis. Clearly a rotation through an angle 
¢, followed by $4, is exactly equivalent to $2 followed by ¢,. Such a 
group in which all the elements commute with one another is called an 
Abelian group and each element forms a class in itself. As the number 
of irreducible representations is equal to the number of elements, (9.23) 
implies that all these representations are one-dimensional. 

If '(@1) is the number representing the rotation ¢, 


T'(¢1)U'(b2) = M1 + $2). 
This equation has solutions 
Peg) = e?. (9.24) 
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As a rotation through 27 is equivalent to zero rotation, if the function is 
: Single valued, 
POM nm) = PMO) =1 
‘and m is restricted to the values 
m =0, +1, +42,.... 
A suitable choice for basis functions is 


Wm, 8, 6) = f(r, Aei™? (9.25) 


9.4 Symmetry and Quantum Mechanics 


Many molecular systems and solids have symmetry properties which have 
important consequences for the solutions of Schrédinger’s equation and 
are related to the occurrence of degeneracies. For the moment only one- 
electron systems will be considered and the Hamiltonian for a particle 

of mass 7 is, 


h? 
HG 1, X2,X3)=— FV? + Vy, x2, ¥3). (9.26) 


If the old co-ordinates x, x2, x3 are replaced by new co-ordinates 
X4,X,X3 (9.2), by some linear transformation, it is well known that the 
Laplacian is invariant. 

a> a a?# ss? 

ax} * axd *axd oxi? “Ox? * axe oO) 
Consider now a ‘symmetry operation’ that brings the system into self- 
coincidence. The potential function is invariant under the change in 
co-ordinates corresponding to this operation. 

As a simple illustration consider the potential of an electron moving 
in the field of two protons. This arises in the hydrogen molecular ion. 

A rotation through 180° about any axis perpendicular to and bisecting 
the line joining the two protons brings the system into self-coincidence. 
In particular the point P is rotated into P' and clearly the potential 
function has the same value at P and P’, i.e. 


V(x1,X2,X3)= V(x, 2,3) (9.28) 


where the co-ordinates (x;,x2, x3) of Pare related to those of P’ 
(x1, x2, x3 by a linear transformation of the form (9.2). When the 
transformation is interpreted passively as a rotation of co-ordinate axes 
instead of as a physical rotation, (9.28) requires that the potential 
function be invariant under the transformation. 

The complete Hamiltonian is invariant under a linear transformation 
corresponding to a symmetry operation. 
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Figure 9.6 Hydrogen molecular ion. 


Suppose the co-ordinates undergo an arbitrary linear transformation 
corresponding to the operator A. If W(x 1, x2, x3) is any wave-function 


AA(& 1, X2,X3)WX1,X2, x3) = H'(x 1,2, x3) W (x1, X92, 3) 
= H"(x1,%2,X3)AW 1, %2,%3) 
(9.29) 
where #” and w’ are the transformed Hamiltonian and wave-function in 


terms of the new co-ordinates. If A isa symmetry operator #’ is the 


same operator as # and (9.29) implies that the Hamiltonian commutes 
with A. 


The important relationships between commuting operators and 
degeneracy was mentioned in Chapter 5. If y is an eigenfunction of # 
Ay =Ey. (9.30) 


Transform this equation by operating on the left by the symmetry 
operator A. 


A(#y) = HA) = EA). (9.31) 
In general the transformed wave-function 
W'(e4,.%2,43) = AW (1, x2, %3) (9.32) 


is not the same as W(x,, x2, x3) and they are two linearly independent 
wave-functions denoting degenerate states belonging to the same energy. 

A simple illustration is obtained by considering the eigenfunctions 
‘for a particle in box. From equations (4.42) and (4.43) the three eigen- 
functions 


2 a 
W(, 1,2) =sin = xe sin — yr sin — zt+— 
a 2 a 2 a 2 


rf a 2n a w a 
= <n — +— in — +— in— +— 
wi, 2,1) sin” (: 5 ; (> ane (: | 


2a a wT a T a 
= in — en +—|sin — +— 9.33 
y (2,1, 1)=sin (+8) sinE(y 2) in2(: | (9.33) 
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| are degenerate and have the same energy, 
E = 6h?/8ma’. (9.34) 


| In (9.33) the origin is taken to be at the centre of the box. w(1, 1, 2) can 
' be transformed into y(1, 2, 1) or (2, 1, 1) by considering the symmetry 
| rotations of 90° about the x and y axes. 

In general, each energy level possesses a set of degenerate eigen- 

| functions which span a space which is invariant under the action of any 

: group of transformation operators which commute with the Hamiltonian. 
Consequently the eigenfunctions produce some representation of the 
group. 

The important relationship between group theory and quantum 
mechanics is the postulate that the set of all degenerate wave-functions 
belonging to a given energy level form an irreducible representation of 
the group of all the symmetry operators of the Hamiltonian. 

Asa general rule, this statement is correct, but sometimes not all 
degenerate eigenfunctions are generated by applying the symmetry 
operators to a given eigenfunction. These extra degeneracies are called 
accidental. A well-known example that has already been referred to is 
the case of the hydrogen atom (Chapter 4). 

The degeneracy between states with the same and / quantum 
numbers is related to the spherical symmetry of the hydrogen atom and 
is discussed later in this chapter. However, the Coulomb degeneracy 
between the states with the same m quantum number but with different 
values for/, does not arise in this way although Fock (1935) has related 
this degeneracy to the symmetry of the Schrédinger equation in 
momentum space. Often accidental degeneracies can be explained by 
deeper investigation. 


9.5 Representative Operators and Symmetry Operators 


A question that springs to mind is ‘what relation is there between the 
operators representing the observables and the symmetry operators of a 
system?’. In Chapter 5 the reader was introduced to the concept of a 
complete set of commuting observables. When the eigenvalues of each is 
specified, the wave-function is known completely, apart froma phase 
factor. In the present chapter it has been stated that all degeneracy, apart 
from accidental degeneracy, can be explained by the symmetry operators 
that commute with the Hamiltonian. Consequently there must be a 
correspondence between representative operators and symmetry 
operators. 

The simplest example is the relation between translational symmetry 
and linear momentum. The Hamiltonian for a single electron in a fieldless 
space is 


2 
#=-_ 


2m 


222 A FIRST COURSE IN QUANTUM MECHANICS 


and the energy eigenfunctions are 
W(r) =e". 


These are also eigenfunctions of the momentum operator 


By 
i 


with eigenvalues ”k. 
Consider the one-dimensional case. A translational operator P(6x) 
may be defined by 


P(Sx)W (x) = W(x + 8x). (9.35) 

An infinitesimal translation operator J,, can be defined by 
P(6x)-1 
Stim SSO =" (9.36) 
5x0 bx 

so that 

; _ Wx +6x)~ Wx) _ dy 

il. = lim ———— =. 9.37 

x (x) pal Bx oe (9.37) 

From the last equation it is possible to deduce the equivalence 

hid 

—-—=hl 

i dx % 
ie. 

bx =hl, (9.38) 


where p, is the linear momentum operator. This is a particular case of 
the relation 


pEhl, (9.39) 


where p is the momentum operator conjugate to qg. In general a represen- 
tative operator that commutes with the Hamiltonian is related to a 
symmetry operator of the system. 


9.6 The Full Rotation Group 


The rotational symmetry of the hydrogen atom is intimately related to 
the angular momentum eigenfunctions discussed in Chapter 4. 

Let P(@, z) be an operator that rotates the co-ordinate axes by an 
angle @ about the z-axis in the positive sense. An infinitesimal rotation 
operator may be defined by 


P(5¢,z) —1 
uf? = lim EAC ball : 


9.40 
8670 5¢ 0-49) 
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| Since, in spherical polar co-ordinates 


P(So, z)W(r, 8, 6) = U(r, 8, 6 + 8) 


then 
aw 
Ty =—. 4 
Ne, (9.41) 
The z-component of angular momentum can be written 
_hoa 
aie a¢ 
and so 
L, =n”. (9.42) 


This final equation is a special case of (9.39) and shows how axial 
rotational symmetry is related to angular momentum about the axis. 
Consider a diatomic molecule and choose the z-axis to lie along the 
line joining the nuclei. The rotational symmetry of the molecule implies 
that the rotational operator J” and hence the angular momentum 
operator Y, commute with the Hamiltonian. The z-component of angular 
momentum is a constant of the motion and the eigenfunctions take the 
form (9.25) with eigenvalues mh. The states with |m|=0,1,2,... are 
denoted by 2, TI, A,.... 
Similarly to (9.42) 


L.=hI and GF, =H”. (9.43) 


Clearly, the infinitesimal rotation operators must obey the commu- 
tation relations similar to those for angular momentum operators, e.g. 


LO? LITO Se, (9.44) 


This result can be obtained without reference to the angular momentum 
operators. 

Infinitesimal rotation operators (but not finite rotations) add like 
vectors and if 7Y”? is an infinitesimal rotation about the a-axis 


Mey +m + nif) (9.45) 


where J, m, n are the direction cosines of the a-axis. This is related to the 
vector properties of angular momentum. 

From (9.40) it can be seen that a finite rotation through an angle 6¢ 
about the z-axis can be expressed by 


P(8¢,z)=1 +1561 + €5¢ 


where e > 0 as 86 >0. 


| 
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The effect of the infinitesimal rotation operators on the co-ordinates 
of a point can be represented in matrix form. Consider a rotation of axes 
through 6@ about the z-axis in the anti-clockwise sense. From (9.3) and | 
(9.1) | 


/ 


x x cos6¢ sin6¢d O\ f/x 
y }=P(6¢,z)| y J ={—sind¢@ cosé6d Oy 
z Zz 0 0 1 4 | 
The operator P(&¢, z) is represented by the matrix 
cosé¢@ sindd O 1 0 0 0 1 0 
—snS¢ cosddé O}=[0 1 O]+[-1 O 0} 5¢+e6¢ 
0 0 1 00 1 0 0 0 


where € > 0 as 6¢ > O and so the infinitesimal rotation operator / Mis 
represented by the matrix 


0 -1 0 
if 1 0 0 
0 00 
Similarly J. iv ) and I ie ) are represented by 
0 0 O 00 1 
if 0 0 -1 and i 000 
0 1 0 -1 00 


respectively. These three matrices do satisfy the commutation relations 
(9.44). 

Any rotation about an axis can be expressed in terms of the corre- 
sponding infinitesimal rotation operator. Consider a rotation through 
the angle ¢' about the z-axis. If the wave function is analytic then by 
Taylor’s theorem 


PO, 207, 8, 0) = V(r, 0,046) 


2. 6” a" W(r, 0, 
=v7,0,0)+ > eee 


n=1 
Since 


ri) 

—= 7") 
ao ue 
then 


P¢’, vr.) + > =o V(r, 0, ¢). 
n=1 
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This can formally be written 


P(¢', z)= exp (i¢'1f”) (9.46a) 
or in the general case for a rotation through @ about the a-axis 
P(6, a) = exp (i019). (9.46b) 


This is an important result and together with (9.45) shows that any 
rotation can be expressed in terms of the three infinitesimal rotation 
operators J), 1) and 16”. 

The full rotation group is the infinite set of all proper rotations about 
all axes through a fixed point. If a function space is invariant under J{”), 
wy ),1£” then it is also invariant under the full rotation group. The 
converse is also true. 


Angular momentum and irreducible representation 


In the next chapter spin momentum operators are defined through 
appropriate infinitesimal rotation operators and it follows that the spin 
operators satisfy the same commutation rules as those for angular 
momentum. It is worthwhile at this point to reconsider the derivation of 
the allowed eigenvalues for angular momentum by an argument based 
solely on the commutation relations. Essentially this is the problem of 
finding the irreducible representations for the rotation group. (See Heine, 
p. 55, for a direct method of finding the representations.) 

Since the operators #? and Y, commute, there exists a set of 
functions which are simultaneously eigenfunctions of both these 
operators and any other operators that are required to form a complete 
set. Let VW be the unique function that satisfies the eigenvalue 
equations 


LV m = 07K Wm 
LV = MAW m (9.47) 


and is also an eigenfunction with given eigenvalues of the remaining 
operators in the complete set. If the system possesses spherical sym- 
metry these operators (apart from Y,) will also commute with 4%, and 
#, . This will be assumed and it is shown below how it is possible to 
generate from y,,, all the simultaneous eigenfunctions of the complete 
set that differ only in their eigenvalue of Y&,. Each of these functions is 
the basis for a one-dimensional irreducible representation of J}”’, e.g. 


LOY =m. (9.48) 
It is convenient to introduce two ‘shift’ (or ‘ladder’) operators F. 
defined by 


L,= L£,+iF, L_= £,-if,. (9.49) 
These operators satisfy the commutation rules 
L,L: —- L: L$, 2=hZ,. (9.50) 
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Consider the function &,,,. Simple algebra gives 
Ll L.Vm) =(L. LZ, th Ls Wm = (m + 1A L+Vm) 
and 
L?(LiVm) = LL Vm =a*hn?( LW). 
All other operators in the complete set commute with Y, and so this 
shift operator generates from W,, a sequence of simultaneous eigen- 


functions that differ only in their z-component of angular momentum. 
The eigenvalues are 


mh, (m + 1)h, (m + 2)h,.... (9.51) 


Similarly, A_ generates from y,,, a sequence of simultaneous eigen- 
function with decreasing z-component of angular momentum 


(m —1)h, (m —2)h,.... (9.52) 


The number of eigenfunctions that can be generated in this way is 
finite. Any physical system will have a finite value for total angular 
momentum. That is, a? is finite. Also, since 


(L2+ L3.m =(L? — L2)bm 
= (a? — m?)h? Wm (9.53) 
then 
a? —~m?>0 


as £2 + ZF is a positive definite operator and must have positive 
eigenvalues. The eigenvalues of Y, are bounded both above and below 
and let 1,# and I_h be the largest and smallest eigenvalues with eigen- 
functions W, and W7_ respectively. 

Since 1h is the largest eigenvalue 


#101, =0 (9.54) 
and similarly 

L_ Wy, =0. ; (9.55) 
From the commutation result (it is easy to verify it) 

GL: L% =L? —- $(L, Fh) (9.56) 


then 
G-LM,205 fa? -e + AW, 
and 
Lr. G-Wy_=07 [a —LL - 1a 
a =1,0,+1)=L@ —- 1) 
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lie. 
(l. +1), —1_ +1) =0. 


Clearly (l, — 1. +1)>0as/, and /_ are the largest and smailest 
quantum numbers and so 


l, =-L. =1 (say) 


‘and 

a” =1(1 +1). (9.57) 
Since each step in the shift operation is by a unit of A then 

4,-1_=2] (9.58) 


is either zero or a positive integer. 

There are (2/ + 1) functions which are simultaneously eigenfunctions 
of the complete set of operators including #2 and Y, which have total 
angular momentum eigenvalue Vi(/ + 1)h. The functions have eigenvalues 


lh,@—Wh,...,-ih (9.59) 


for &,. 

The above analysis shows that this set of (2/ + 1) functions span a 
space that is irreducible under the set of operators #,, #, and L_ 
and hence under ¥,, Y,, L,. From (9.42) and (9.43) this space is 
irreducible under the infinitesimal rotation operators and hence under 
the full rotation group. The irreducible representations may be denoted 
by D© where the dimensionality (2/'+ 1) is a positive integer. 

The argument has been based solely on the commutation relations 
satisfied by the angular momentum operators and is directly applicable 
to the spin and total angular momentum operators to be met in the 
succeeding chapters. 


The one-electron central field problem 


The central field Hamiltonian (5.23) is invariant under the full rotation 
group and so commutes with the angular momentum operators. Its 
eigenfunction must belong to one or other of the irreducible 
representations. D, These eigenfunctions are from (5.41) 


nim (7, 8, 0) = Rni(7)PI"#(cos 6) 6°" 
and are clearly reduced with respect to rotations about the z-axis 
LOW nm, = mW nim). 


By using H, and _ it is possible to generate the degenerate set of 
eigenfunctions 
Vnim, m=+1,+(—1),...,.-1 (9.60) 


This set forms the basis for the irreducible representation with 
dimension (2/ + 1). 
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The s-functions (/ = 0) transform according to D©), the p-functions | 
transform like D“ and the d-functions like D@). . 

The degeneracy of the coulomb potential wave-functions with the 
same principle quantum number but different 7 quantum numbers is not 
explained by the rotational symmetry of the atom in three-dimensional | 
space. 

In the above, / is required to be an integer as the wave-functions must, 
be single-valued and so the irreducible representations have odd dimen- 
sion. However, representations of even dimension do exist. In particular, 
the representation D(/2) of dimension two will be used in the discussion 
of electron spin in the next chapter. 


9.7 Electron in a Periodic Field 


The lattice periodicity of crystals allows the energy eigenstates to be 
classified by a real wave vector k and the eigenfunctions are called 
Bloch functions. The essentials of the problem are contained in the 
one-dimensional case considered below. 
The Schrédinger equation for a line of identical atoms, distance ‘a’ 
apart, is 
h? dy 


~ Sm ga? * VO) =v (9.61) 


with 
V(x -a)= V(x). 


The substitutional operator representing the translation na is written 
{e] na} with 


{e|na} V(x) = V(x — na). (9.62) 


If the line of atoms is infinitely long the number of lattice translation: 
is infinite. To keep the number of distinct translations finite it is usual 
to introduce periodic boundary conditions. The finite line is divided up 
into identical ‘micro-crystals’ each containing N atoms so that a trans- 
lation through Na produce no change at all, i.e. 


{e|0} = {e|Na}. (9.63) 


The translation operators form a group of N elements and as all 
translations commute with one another, the group is an Abelian group. 
As already mentioned previously in the discussion of the axial rotation 
group, each element in an Abelian group forms a class in itself and, the 
number of irreducible representations is equal to the number of elements 
N. All these representations are one-dimensional. 

The operator V? is invariant under a translation of the co-ordinate 
system and so the Hamiltonian is also invariant. Consequently the energy 
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eigenfunctions must transform according to one of the irreducible 
representations of the group. An alternative view point is to note that the 
Hamiltonian and translation operators commute with each other so wave- 
functions may be chosen to be simultaneously eigenfunctions of # and 
all the operators {e|na}. 
If p is the basis for one of the irreducible representations 
fela}v@)=Av(x) and — {ela} W(x) =A" V(x) 
where A is a complex constant. The periodic boundary conditions require 
WW =1 
' and so A must be one of the Vth roots of unity 
MeN FET 2 a. 
If a real wave number k is defined by 


then the irreducible representations and hence the energy eigenfunctions 
may be classified by the wave number and 


{e| ma} W(x) =e W(x). (9.64) 
This is Bloch’s theorem and a wave-function satisfying this condition is 
We(x) = eu; (x) (9.65) 


where u;,(x) has the lattice periodicity. 
9.8 Group Theory and Perturbation 


As has been seen in Chapter 7, perturbation of a physical system often 
results in the removal of degeneracies. This is of course to be expected 
as the perturbed Hamiltonian has in general a lower symmetry than the 
unperturbed Hamiltonian. The previously degenerate wave-functions 
now form a basis-for a reducible representation for the new smaller 
symmetry group and will split into two or more irreducible represen- 
tations. A degenerate level will divide into two or more levels of 
diminished degeneracy. 

In the discussion of the first-order Zeeman effect in Chapter 7 it was 
shown how the (2/ + 1) degenerate eigenfunctions of the hydrogen atom 
with given n and / quantum numbers split into (2/ + 1) separate levels 
when an axially symmetric magnetic field is applied. 


The crystal field as a perturbation 


Important examples of the removal of degeneracies by perturbation occur 
in the splitting of the energy levels of paramagnetic ions of transition and 
rare earth metals in a crystal. The metallic ion and its surrounding neigh- 
bours (water molecules etc), should be solved as a many-body problem. 
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However, it is often a good approximation to consider the neighbours as 
producing an electrostatic crystal field which perturbs the degenerate 
(d or f) eigenfunctions of the free metallic ion. Paramagnetic resonance 
is a tool that has proved useful in experimental measurements. 

The Hamiltonian of an ion in the crystal is written 


HEH +h, 


where #° is the spherically symmetrical free ion Hamiltonian and he 

is the electrostatic potential energy of the ion in the crystal field. In the 
intermediate crystal field case, 4, is greater than the spin-orbit coupling 
terms (spin is introduced in the next chapter) and the main effect is to 
split the central field orbital degeneracy. The full analysis involves the 
many-electron wave-function and will not be attempted here. A simplified 
discussion is given below. 

Ions of the iron transition group with partially-filled 3d-shells often 
belong to the intermediate field case. Often they occur in surroundings 
that are almost cubic, i.e. 4, has cubic symmetry. 

The five degenerate d-wave-functions 


2ety)2, &tnYfO, V4IGB2-r)FO 


form a basis for the irreducible representation D) of the full rotation 
group. This 5-fold degenerate level splits up into a 2-fold and a 3-fold 
level under the perturbing action of the cubic crystal field. To see this it 
is only necessary to consider the 24 proper rotations of the group 432 
that leaves a cube invariant. (The full cubic group m3m contains 48 _ 
operations and can be expressed as the direct product m3m = 432 x I.) 
The 24 elements are divided into 5 classes as below 


(a) The identity. 

(b) The 4 3-fold diagonal axes making 8 elements denoted by 3. 

(c) The 3 rotations through 180° aboyt the cube edges denoted by 2,. 

(d) The 6 rotations through 180° about the diagonals in the sides of the 
cube denoted by 2,. 

(e) The 6 rotations through +90° about the cubic edge denoted by 4,. 


Table 9.4 The character table for this group is given below. 
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The 5d-wave functions transform according to a reducible representation 
of this group and the characters are given in the last row of Table 9.4. It 
is immediately clear from the table that this representation is composed 
of the two irreducible representations F and T> and so the 5-fold 
degenerate level splits up into a 2-fold level and a 3-fold level. 

In many cases the almost cubic crystal field may include smaller 

order terms with a further reduced symmetry and the degeneracy may be 

further reduced. The reader is referred ta Heine, p. 148, and Schutte, 


p. 348, for further details. 
ener 
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Figure 9.7 Splitting of 5-fold D‘?)-degeneracy by crystalline fields. 


9.9 Selection Rules 


Group theory is very useful in determining which transitions are allowed 
in the interaction of electromagnetic radiation and electrons in atoms 
and molecules. In Chapter 8 it was shown that the transition probabilities 


for a single electron are proportional to 


2 
| { XFEX: dz| 
where x; and xy are the initial and final state functions. 
It can be shown that if the function @ is one of the basis functions for 
an irreducible representation of a symmetry group then the integral over 
all co-ordinates 


Joar=o 
if the representation is not the unit representation. The three components 
of r transform like the three p wave-functions, i.e., according to the 
irreducible representation D@) of the full rotation group. This fact 
together with a knowledge of the symmetry properties of x; and xr 


(8.38) 
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make it possible to decide when the integral (8.38) is zero and hence 
which transitions are forbidden. This topic is dealt with in detail in 
Chapter 11 and will not be pursued here. 


PROBLEMS 


1 Prove that the elements of the transformation matrix a (see equation, 
(9.3)) satisfy the relations | 


3 
Dof#=1 — foralli 
j=1 


> ij; Aim =0 j -~m. 
Prove also that the determinant of the transformation matrix is +1. 


2 Sketch the two functions 


Py=fQ@)sin@ py=f(r)cosé. 


If 4, is the substitutional operator representing a rotation of 7/2 radians 
in the anti-clockwise direction about the z-axis, show that 


4,Py = Dx. 


3 Describe the eight symmetry operations which leave a square 
invariant. The two faces of the square are not identical. Construct a 
group multiplication table. (This is the group 4mm.) 


4 Repeat question three for a rectangle. (In this case the symmetry 
group is composed of four elements and the group symbol is 27m.) 


5 Show that the four elements 1, i, —1, —i where i= +./—1 forma 
group. Draw up a group multiplication table. 


6 Consider a system which has a single 4-fold rotation axis in the z- 
direction. Identify the four elements and draw up a group multiplication 
table. This is the group 4. Show that this table can be made to have the 
same form as that of question 5 by pairing off each element of the 
group 4 with a suitable element of the group of question 5. The two 
groups are said to be isomorphic. 


7 Areal operator A is related to its Hermitian adjoint A* by 


[ory dr= [tary dr 


where ¢, y are arbitrary functions and the integration is over all space. 
If A isa linear operator which possesses an inverse A~! it is said to be a 
unitary operator if A+ = A7}. 
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The integral over all space f @*y dr is invariant under any reflection 
or rotation of axes. Explain how this implies that a symmetry operator 
is unitary. 


8 Table 9.3 gives the group character table for the equilateral triangle 
group 3m. 

The character of the group element A is the a-irreducible representation 
is denoted by x (A). Verify the ‘orthogonality relations’, 


ZX"A)xP)=0 a #8 
A 


ZX") =N. 


The summation is over all the V elements in the group. 


9 Suppose x(A) is the character of the group operator A in the 
reducible representation I’ and x (A) is the character of A in the 
irreducible representation ', Then 


x(A) = 2aix(A) for all A 


where a; is the number of times !' occurs in I. 
By multiplying both sides by x/)*(A) and summing over all members 
of the group show that 


aj yo x*(A)x(A) (1) 


where N is the number of elements in the group. (Hint: Use the 
orthogonality relations given in question 8.) 

Consider the reducible representations of the group 3m given by the 
first row of Table 9.2. Use equation (1) to show that this representation 
is composed of irreducible representations A, £. 


10 Write down the 24 symmetry operations which leave a regular 
tetrahedron invariant. This is the group 43m. Divide the group elements 
into the five classes and give the dimensions of all the irreducible 
representations. 


11 Ina free atom the atomic orbitals (25), (2px), (2py) and (2p,) are 
degenerate. In a tetrahedral molecule, such as methane the molecular 
field perturbs the free-atom carbon wave-functions and destroys this 
degeneracy. Choose one element from each of the five classes of the 
tetrahedral group 43m and construct the matrices of the reducible 
representations with basis (2s), (2p), (2py) and (2p,). Show that this 
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representation is composed of a single one-dimensional and one three- 
dimensional irreducible representations. (Hint: Choose the co-ordinate 
axes so that a three-fold axis lies along the (111) direction.) 


12 Suppose a uniform electric field is applied along one of the three- 
fold axes (C—H bond) of the methane molecule. The reduced symmetry 
group at the carbon atom is the group 3m composed of six elements and 
the group character table is given below. 


Identify the elements A and B. 

Show that the three-dimensional irreducible representation of the 
group 43m with basis (2p,.), (2py) and (2p,) is composed of two 
irreducible representations of the reduced symmetry group 3m. 

This result shows that the three-fold degenerate level splits up into a 
non-degenerate and doubly-degenerate level when the electric field is 
applied. 


13 Let a be an axis in the yz plane through the origin at an angle of 0 
to the z-axis. A physical rotation through ¢ about the a-axis is equivalent 
to a physical rotation 6 about the x-axis so that the a-axis coincides with 
the z-axis, followed by a physical rotation through ¢ about the z-axis 
followed finally by a physical rotation through —@ about the x-axis. 


P(—9, a) = P(@,x)P(—¢, z)P(—6, x) 


where P(@, x) denotes a passive rotation of the co-ordinate system 
throu x6 about the x-axis. Use (9.45) to express J”) in terms of my 
and J” and with the help of (9.46a) show that 


LOT = TLOT = The 


by equating coefficients of the product 6¢. 
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CHAPTER 10 


Electron Spin 


10.1. Introduction 


The quantum mechanical theory developed in the previous chapters is 
not sufficient to explain all the experimental results of atomic physics. 
It has already been shown that the magnitude of the angular momentum 
of a particle in a central field problem can take the values 4 V1 (+1) 
where / is an integer. This follows from the differential form for the 
angular momentum operator together with the requirement that the 
wave-function be single-valued. However, half integral values of the 
angular momentum quantum numbers are necessary to explain the fine 
structure in the spectral series of some elements. The sodium D-line 
doublet is a well-known example. 

Also the anomolous Zeeman effect cannot be explained by integral 
values of / alone. Schrédinger theory indicates that the lowest state for 
the sodium atom is non-degenerate with zero angular momentum but 
the Zeeman effect in sodium can only be explained by assuming the 
ground state splits into two levels when a weak magnetic field is applied. 

Apart from this spectroscopic evidence, Stern and Gerlach in 1921 
demonstrated experimentally the existence of the magnetic moment of 
an electron. They did this by passing a narrow beam of silver atoms 
through an inhomogeneous magnetic field and observing that this beam 
split into two beams. (Note that a beam of electrons is not used since 
the Stern-Gerlach separation is then of the order of that imposed by the 
uncertainty principle.) 


10.2. The Electron Spin 


Uhlenbeck and Goudsmit in 1925 suggested that the electron has an 
intrinsic angular momentum and in 1927 Pauli introduced electron spin 
into quantum mechanics. 
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The experimental results can be understood if it is assumed that an 
electron has an intrinsic spin angular momentum of magnitude 
h~v/s(s + 1) where s = 4. Further, the component of the angular 
momentum can only be known along any one prescribed direction, 
say the z-direction and must take one of the values m,h with 


m, = +4. (10.1) 


Also, it must be assumed that associated with the spin angular 
momentum the electron has a magnetic moment equal to —eft/m times 
the spin momentum where m is the electron mass. 

These assumptions arise naturally in the Dirac relativistic theory but 
must be postulated in the non-relativistic theory now under discussion. 

In the Schrddinger theory, the three position co-ordinates of an 
electron form a complete set and when all three are known the electron 
state is specified completely. Now for a complete specification of a state 
the projection of the spin momentum along a prescribed direction must 
be known also. 


Postulate 7. Spin 


It is now postulated that a particle, in addition to its spatial co-ordinates, 
has an internal degree of freedom characterized by a spin co-ordinate o,. 
The particle state function is then a function of four co-ordinates 

v(x, y, Z, Gz). The spin co-ordinate can only take a finite number of 
discrete values. For electrons, positrons, neutrons and protons, 0, can 
take either of the two values +1. (This corresponds to the z-component 
of the spin momentum taking the values 4/2.) The rest of this chapter 
is concerned only with such particles with spin 1/2. 

The quantum mechanics based upon the seven postulates in this book 
is valid so long as the energies included are small compared with the 
energy associated with the particle rest mass (~0-5 MeV for electrons). 
At high energies a relativistic theory should be applied. 

The operators representing the components of orbital angular 
momentum have been obtained from the classical expressions using 
Schrédinger’s substitution for momentum. Because the spin co-ordinate 
can take only two values it follows (see below) that an electron spin 
momentum components can only take the values t#/2 and hence in the 
classical limit 4 > 0, spin vanishes. Spin does not occur in classical 
mechanics, and an alternative method must be used to define spin 
momentum. 

In Chapter 9 it has been shown how p, the momentum operator conjugate 
to q, is related to the infinitesimal translation (rotation) operator Ig by 

p=hlq. (10.2) 
The infinitesimal rotation operators discussed in section 9.7 transform 


the three space co-ordinates and satisfy the three commutation rules of 
the form (9.44). These rules are a basic property of rotation operators. 
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The spin co-ordinate will be transformed under a rotation of axes and the 
infinitesimal rotation operators J{°, i and J§°) which carry out this 
transformation must satisfy the basic commutation rules of the type 
(9.44). 

The fundamental result (10.2) can now be extended to define three 
spin momentum operators 


sy, =A, sy =n, — 5, =hlS), (10.3) 


The spin infinitesimal rotation operators (unlike the space co-ordinate 
infinitesimal rotation operators) do not have a differential operator form 
and so neither do the spin momentum operators. However, the spin 
momentum operators must satisfy the same commutation relations as 
Ly, Ly and L,. That is 

SxS) — SySx =ihs, 

SySz — 828y =ihs,. 

S28x — 8x8, = ihsy. (10.4) 
As the spin component operators do not commute with each other only 


one of them, say sz, can be known with certainty at any instant. The 
total spin momentum operator defined by 


Sap Sse bas 5? (10.5) 


commutes with all three components and so the total spin momentum 
can be measured simultaneously with any one of the three components. 


10.3. Spin Wave-Functions and Representations 


The wave-function for a single electron is now to be regarded as a 
function of four variables. The three space variables are continuous but 
the spin variable can only take discrete values +1. So 


W(X, Ys Z, 2) = W(x, y, z)a(oz) + P_(, Y, z)B(oz) (10.6) 
where a(o,) and 8(c,) are functions such that 

a(1)=1 and a(—1)=0 

B(1) =0 and 6(-1)=1. (10.7) 


W(x, y, z) is the ‘component’ of the wave-function with spin co-ordinate 
o, =1 and w_(x, y, z) is the ‘component’ with spin co-ordinate 0, = —1. 
If the electron is in a state defined by o, = 1 say, then 


W(x, ¥, Z, 02) = Wi, y, Z) 
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In the general case f |y4|? dr is the probability of observing the state 
0g, = 1. The values of a, correspond to the eigenvalues of sz. This is 
explained below. 

A rotation of axes corresponds to a linear transformation of a(o,) and 
B(a,). From (10.6) all functions of a, lie in the two-dimensional spin- 
space spanned by a(o,) and B(a,) and so the space is invariant under 
the rotation group. For particles with spin } this space must be 
irreducible under the group and a(o;), 8(¢,) form a basis for the 
representation D(/2), Conventionally o(o,) and B(o,) are taken to be 
the eigenfunctions of the rotation operator J, fr ) with eigenvalues 
ms = 4, —} respectively (9.48), i.e. 


1a(oz)=4a(0,) with ms =4 


or 
h 
5z0(0,) = 2 a(o3) 
and 
I$ B(o,)=—38(o,) with m,=—4 
or 


52B(02) = —5 B(0,)- (10.8) 


The spin operator sdp = 53 +s}, + 5? has eigenvalues $(3 + 1)h? = 3? in 
both states (see (9.59)). The results agree with the requirements 
mentioned in section 10.2. 

If P°(@, z) is the operator that represents a rotation of @ about the 
z-axis and acts only on the spin co-ordinates, then from (9.46) 


P*(@, z)a(oz) = exp(id/2)a(oz) (10.9) 
and similarly for 6(¢,) 
P*(2n, z)o(9,) = —o(9,) (10.10) 


and 


PS (4n, z)a(oz) = +a(o,). 


Rotations of 47 must be considered before a(a,) and 6(oz) are trans- 
formed identically into themselves. These functions are called spinors. 
It has been stated in postulate 1 that all wave-functions must be 
single-valued. This is not strictly true. The physical requirement is that 

integrals of the form 


[utay;ar 
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be single-valued. If y; and y; both change sign under a rotation of 27 
applied to all co-ordinates then clearly the integral is invariant. However 
arguments have been given (Blatt and Weisskopf 1952, H. E. Rorschach 
1962), to show that wave-functions depending on the three space co- 
ordinates x, y, z must be single valued as already stated. This argument 
does not apply to wave-functions depending on spin. 


Matrix representation 


The spin momentum operators do not possess a differential operator 
representation. However, a set of representative matrices can be con- 
structed. If the spin eigenfunctions are regarded as basis functions then 
an arbitrary wave-function W(x, y, Z, dz) is represented by the column 
vector (see (10.6)). 


Wa, y 4 ) | 
W_@y,z)]. 
Normalization of the wave function w(x, y, z, o,) requires 


> five», zo)? dr= 5 J flvsl2a2(oz) + 1v_l67(02) 
+(WFW_ + ¥+0")a(oz)6(o2)] dr=1 


where the summation is over the two discrete values o, = +1. 
From (10.7) 


2,4(02)B(02) =0 


(10.11) 


and 
2.a*(;) = 26"(o2) =a 
oe ° 
f [lve » ZI? + W_@ », z)/?] dr=l. (10.12) 


The probability of the z-component of electron spin being measured 
as th/2 is 


five y z)I? dr 
and similarly, the probability of spin —h/2 is 
flv », z)Par. 


The column vectors representing a(o,) and B(a,) are respectively 


= : d = " 10.13 
a=|'| an e-[]. (10.13) 
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The matrices representing the spin momentum operators Sy, Sy and s, 
must be 2 x 2 and the reader can easily verify that the choice 


22 (0 1 _alo -i _afi 0 
Oli of. ole of ~* alo 4 


(10.14) 
satisfies the commutation relations (10.4). These can be written 
h h 
%==0,, sy=he, s,=—2, (10.15) 
2 2 2 


0 1 0 -i 1 0 
= = = wl 
we). o2(0 a. a(t 9) caso 


and o,, 6, and o, are known as the Pauli spin matrices. All these matrices 
are Hermitian and have real eigenvalues. The correctness of the choice 
(10.14) is confirmed by noting that the eigenvalue equations 


h h 
a= 7a and s2B= 38 


are represented by 


(10.17) 
respectively. 
Note also that from (10.13) and (10.14) 
h h ih ih 
sx=> 8B, 8B > &, sya= 8, syB= “7% 
(10.18) 


10.4 Total Angular Momentum 


It is necessary to know how the one-electron operators which are 
functions of the space co-ordinates and momenta (and not of spin) are 
represented in the new two component theory. Clearly such operators 
must be represented by 2 x 2 matrices. The (1, 1) element of the matrix 
representing a Schrédinger operator yy which is spin independent, is 


2 a(o,)ya(o,) = 7 2.a°(a2) =¥. 
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Similarly the (2, 2) element is 

> B(oz) B(0z) = ¥ 2B’ (oz) = 7. 

Oz Oz 
However the off-diagonal elements are both zero as 

2 a(0,)7B(0z) =7 2 a(0,)B(o,) = 0. 

Oz oz 


So, all spin independent operators, such as 7, are represented by a 
diagonal matrix 


0 
0 ¥ 


All such diagonal matrices commute with the Pauli spin matrice. 


Total angular momentum 


The total angular momentum of an electron is defined to be the sum of 
the orbital angular momentum and the spin angular momentum. The 
2 x 2 matrix operator representing the z-component is 


= Li1+s,2=|'% se > (10.19) 
0 Fag 2 


J, has half-integer quantum numbers as #, and s, have integer and 
half-integer quantum numbers respectively. It is instructive to demon- 
strate this using the two component theory. The eigenvalues of J, are 
denoted by mj and in two component form, the eigenvalue equation 
is (in spherical polars) 


22 (9) o2() 2) (Ys)=mn(¥e) 


This represents two equations, 


1]o i 

Pa E tly. = m+ 
i) i 

i E ~ | Vas te 
The first equation may be written 


ows 
a yy 


v+(r, 8, 6) =A4(r, 0) exp [i (mm; — $)¢]. 
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As W, must be invariant under a rotation of 27 about the z-axis 
m; — d= integer 
m,; = integer + 3. 
Similarly 
v_(r, 8, $)=A_(, 8) exp [i(m; +4) 4] 
and 
m, = integer 4, 


Therefore the eigenvalues of J, are half-integral multiples of h. 
The matrix operators representing the x and y components of the 
total angular momentum are 


I.= Zits, JIy=LZ,Its. 
The matrix representing the squared total angular momentum is 
Pet + 
=H 1+s? +2[ Gs, + Ls, + Fs;] (10.20) 


as the matrices representing the orbital and spin components commute. 
The matrices J,,, J, and J, satisfy the same commutation relations 

as the orbital and spin angular momentum operators and a straight 

forward manipulation will show that each component commutes with J?. 

‘So, functions can be found which are simultaneously eigenfunctions of 

J, and J”. 


Rotation operators 


The total angular momentum can be associated with infinitesimal 
operators representing a simultaneous rotation of both space and spin 
co-ordinates. From the definition, the z-component of the total angular 
momentum is equivalent to the sum of infinitesimal rotation operators 


AUS? +18) 


and similarly for the x and y components. 

Consider now an operator which produces a finite rotation of both 
space and spin co-ordinates through an angle 6 about the z-axis. This 
operator can be expressed as a product of two operators one of which 
acts on the space co-ordinates and the other on the spin co-ordinate. If 
i) is the operator representing an infinitesimal rotation of all four 
co-ordinates, then from (9.46b) 


exp ((61{) = exp (01) exp (OIL). 
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This is valid for all 9 and so by equating the coefficients of 6 
If) = 1) +18), 
In conclusion, the z-component of the total angular momentum is 


equivalent to the ‘total’ infinitesimal rotation operator, J‘) multiplied 
by ht, ie. 


nis? 
and similarly for the x and y components. These three new ‘infinitesimal’ 


rotation operators /{"), if) and I) clearly obey the commutation 
relations of the type (10.4). 


Eigenvalues 


The argument given in section 9.7, which depends only on the commu- 
tation relations, now indicates that J? has a set of (27 + 1) eigenfunctions, 
with 27 integer, with common eigenvalue j(j + 1)h?. Each of these 
eigenfunctions may be chosen to be simultaneously an eigenfunction of 
J, with separate eigenvalues 


mjh = jh, (j —\)h,...,—Jh. (10.21) 


For a single electron, mj and hence / is half-integral. 


Addition of angular momenta 


Consider a one-electron system whose orbital angular momentum and 
spin momentum are conserved. It is shown below that for a given / the j 
quantum number can take only two values, namely / + $. 

The four matrix operators J?, J,, FI and s? all commute with one 
another and the corresponding variables are simultaneously measurable. 
Alternatively, the four matrices #71, FI, s?,s, also form a mutually 
commuting set and the observables are compatible. 

Suppose these two sets of four operators are taken to be complete. 
Any other operators needed to make the sets complete will be assumed 
to have fixed eigenvalues. Normally the Hamiltonian is one of these and 
it must be spherically symmetrical and not include any spin-orbit 
coupling term. For a single electron s? can only have the eigenvalue 
3h2/4 and any state can be specified by the three quantum numbers 


@) jmp! or by (ii) 4 m, ms 


respectively. 

In order to find the different possible eigenvalues of J? associated 
with the first set of operators it is useful to consider the labelling of the 
different states arising from the second set. 
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Consider the second set. For a given J, m; can take (2/ + 1) values and 
ms can take two values and so there are 2(2/ + 1) different degenerate 
states altogether. Linear combinations of these states can be constructed 
that are eigenfunctions of the first set of operators. Consequently, for a 
given / there must be 2(2/ + 1) different states. 

From (10.19) mj =m, + m, and the third column of table 10.1 shows 
how the allowed values for m; in the second set are obtained while the 
fourth and fifth columns show how they arise in the first set. 


Table 10.1 


Second set First set 


I + 
wi 
| +7 
+4 
+ 
ee) 
-4 


It can be seen that all 2(2/ + 1) states for a given /, that occur in the 
first set can be accounted for by letting j take the values 


jal 1>0. (10.22) 


In the language of the ‘vector model’, the electron spin angular momentum 
is either parallel or antiparallel to the orbital angular momentum. (Note, 
when / = 0,7 must be equal to 4). 


10.5 Spin Magnetic Momentum 


It is well known that a classical electron moving in a circular orbit with 
angular momentum L has a magnetic moment —eL/2m where m is the 
electron mass. Perhaps not surprisingly there is also a spin magnetic 
moment associated with the electron spin momentum. Using relativistic 
arguments (F. Rohrlich, 1965), it can be shown that the relationship 
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between the spin angular momentum of a point electron and its spin 
magnetic moment pis 


=—ges/2m 


where g = 2 to a first approximation. 

(This value of g is slightly in error due to the neglect of radiation 
reactions.) Observe that the ratio of spin magnetic moment to spin 
momentum is twice as large as the corresponding orbital ratio. 


Spin spectroscopy 


Suppose an electron is in a uniform static field of induction B. This field 
interacts with the electron moment to give a contribution 


—p.B (10.23) 


to the energy. If the spin-independent terms are omitted the Hamiltonian 
can be written 


e 
— [BuSx + Bysy +Bz82] (10.24) 


where B,, By and B, are the components of the field. 
If the field is parallel to the z-axis, the time-independent Schrodinger 
equation becomes 


eh, (1 0\(v+\_ [vs 
(4 ‘Ni )-2(7); (10.25) 


This two component equation has two eigensolutions. The upper 
level has energy 


with eigenvector B. So the effect of an applied static magnetic field B is 
to split an electron energy level up into two levels with energies 
+ehB,/2m corresponding to the two spin directions. This is essentially 
the explanation of the anomalous Zeeman effect in atoms with one 
valence electron. 

One method of observing these levels is by electron spin resonance. 
The system in the static field B is subjected to an alternating electro- 
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magnetic field of angular frequency w. When the frequency satisfies 
hw = ehB,/m 
ie. 
w=eB,/m 


transitions are induced and resonance occurs with energy absorption 
from the alternating field. Electron resonance experiments are generally 
carried out at fixed frequencies and the field B, varied. Such experiments 
can give useful information about the magnetic moments of more com- 
plicated systems and hence about their structures. 

Atomic nuclei also have spin and associated magnetic moments. The 
splitting of the nuclear levels in a magnetic field can also be observed by 
resonance techniques. The resonance frequency for electrons is in the 
micro-wave region whereas the heavier mass of the nuclei result in a 
smaller (hyperfine) energy splitting and the corresponding resonance 
frequencies are in the radio frequencies (see problem 7). 


Fine structure 


The energy expression (4.71) does not account completely for the 
observed spectrum of hydrogen. Many of the spectrum lines are com- 
posed of several components illustrating a fine structure. In 1916, 
Sommerfeld modified the Bohr atom theory by taking account of the 
relativistic mass change of the electron in an elliptic orbit. He found that 
this introduced a small energy correction, so that levels with the same 
principal quantum number but a different azimuthal quantum number 
have slightly different energies. The corresponding fine structure gave 
good agreement with the observed splitting for hydrogen. 

Even before the development of quantum mechanics it was evident 
that this theory did not explain the spectra of the alkali atoms. An analy- 
sis of the penetration of the single valence electron into the ion core 
shows that the energy levels depend very much on the azimuthal quantum 
number and that Sommerfeld’s relativistic mass effect is small in 
comparison. 

The spectrum lines corresponding to a transition from the level (n, J) 
to the level (n‘, 1’) often show a fine structure which is of the order given 
by Sommerfeld’s correction although his argument cannot apply as any 
splitting of the upper (lower) levels must produce two or more levels 
with the same azimuthal quantum number. The doublet splitting of the 
alkali atoms can be explained on the basis of electron spin and the 
theory is outlined below. The fine structure of hydrogen is really a com- 
bination of spin and relativistic mass change effects. Of course spin itself 
follows from a relativistic formulation of quantum mechanics, 

The energy levels associated with the excitation of the single valence 
electron of an alkali atom can be evaluated using a single-electron model 
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in which the combined effect of the core electrons is represented by a 

non-coloumbic central field V(r). The accidental degeneracy of the levels 

with the same principal quantum number but different / is then removed. 
Ignoring spin, the Hamiltonian 


0 — n? 2 
#9 =-5—V+VO) 


commutes with the orbital angular momentum operators and the energy 
levels have degeneracy (2/ + 1). Each eigenfunction may be specified by 
the three quantum numbers n, 1. m;. On the incorrect assumption that 
the Hamiltonian is unaffected by spin, each of the levels mentioned 
below is now composed of two levels corresponding to m, = +4. The 
degeneracy is doubled to 2(2/ + 1). 

However, the Hamiltonian is really changed on the introduction of 
spin and the 2(2/ + 1) levels are not in fact all degenerate. This effect is 
called ‘spin-orbit’ coupling. 

In a frame of reference that is stationary relative to a nucleus an 
observer will only experience the electric field &. However, in a frame of 
reference moving with an electron orbiting the nucleus, an observer 
would experience a magnetic induction, 


=—VA&4lc? (10.26) 


where V is the orbiting electron velocity and c is the speed of light. 
This is a relativistic effect (Ponofsky and Phillips, p. 284). 

This field interacts with the spin magnetic moment to give a 
contribution 


—4y.B (10.27) 


to the total energy. The factor one-half is the Thomas factor (1926) and 
is due to the fact that the Hamiltonian is expressed in a field of 
reference in which the nucleus is at rest. 

The electric field may be written 


juke 


10.28 
eqdr ( ) 


(take the nucleus at the origin) where V(r) is the central field potential 


energy. 
By combining (10.26), (10.27) and (10.28), it can be seen that the spin 
orbit interaction term to be added to the central field Hamiltonian #° is 
1 dV 
Paine ae { Zs. + Fysy + £,s,}. (10.29) 


This perturbation is a small effect but does give rise to fine structure of 
spectrum lines. 
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The 2 x 2 diagonal matrix #°I representing the original central field 
Hamiltonian commutes with the matrices representing the orbital 
angular momentum operators and the spin operators, and the five 
matrix operators 


HN, LA, GFA, s’*, s, 


form a complete set of commuting operators. For a single electron, s? 
always has the eigenvalues 3h7/4 and so a state is specified completely 
by the four quantum numbers n, J, m,and m,. Alternatively the 
commuting set of matrix operators 


#1 VY, I, GI, ? 


is complete and the quantum numbers n, j, mj and / completely define 
a state. 
The new Hamiltonian, including the spin-orbit coupling 


H1+h 


no longer commutes with the orbital momentum operator #,I and the 
spin operator s, and so m, and m, are no longer good quantum numbers. 
From a symmetry viewpoint, the Hamiltonian is no longer invariant 
under a rotation of space co-ordinates or of spin co-ordinates alone. It is 
however invariant under a simultaneous rotation of both the space and 
spin co-ordinates. 

The new Hamiltonian forms a commuting set with the operators 


VY, J, £71, s?. 


However, whereas the unperturbed energies depend only upon the 
quantum numbers n, / the perturbed energies depend also upon j. Under 
the action of the spin-orbit coupling (10.29), the 2(27 + 1) degenerate 
levels with a given n, J split into a number of distinct levels which differ 
only in their values of /. 

The spin-orbit coupling term can be written (10.29) 


—! Vin gy_e 
Gene ap {I* —-#*I-s*}. (10.30) 
This is strictly a problem involving perturbation of a degenerate level, 
but if the unperturbed wave-functions are specified by n, 7, mj, 7 the 
off-diagonal terms of the perturbation determinant (Chapter 7) are 
already zero and to a first order, the change in energy of a level is the 
expectation value of (10.30), i.e. 


1 f/idv\ 
ate Ge UG +1)—-1@ + 1) —4]n?. (10.31) 
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For a given n and a given / #0, the level splits into a doublet correspond- 
ing to j = +4. The energy separation between the levels is 


h? 1 dV 
SE = aa (21+1) i’ : (10.32) 
n, 


There is no splitting of an atomic level with / = 0 as in this case 7 must 
equal }. 

The sodium D-lines may be explained by this theory and (10.32) 
gives the correct order of magnitude although there is difficulty in 
specifying V(r) exactly. 


Landé g-factor 


An important quantity in the analysis of atomic spectra is the ratio of 
the magnetic moment of an atom to its angular momentum. This is the 
gyromagnetic ratio and it is usually written as 


e 
5 (10.33) 


where |g] is the Landé g-factor. The g-factor for spin is eq ual to 2. 

In Chapter 7, in the discussion of the weak-field Zeeman effect for a 
central-field (spin-less) electron in a uniform external magnetic field B 
in the z-direction, it is shown that the extra terms to be added to the 
Hamiltonian due to B is 


eBhd_e 
ieee as 777 
2m i a6 2m le ep) 
where 7m is the (reduced) electron mass. From (7.77) it is clear that the 
g-factor for orbital momentum is unity. 
When spin is included the field interaction term to be added to the 
Hamiltonian is 


h = (2 +25) = Oe +5). (10.34) 


If the field is weak enough this term is small compared with the spin- 
orbit coupling and may be regarded as a perturbation to the Hamiltonian 
including spin-orbit coupling. The unperturbed eigenfunctions are speci- 
fied by the quantum numbers n, j, mj, The perturbed Hamiltonian does 
not commute with J? and so / is strictly no longer a good quantum 
number and off-diagonal terms do occur in the perturbation determinant. 
However it can be shown (Bethe, p. 114) that the non-zero terms occur 
between states of different 7 and not between states of the same 7 and 
different mj. The off-diagonal terms can be neglected because of the 
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relatively large energy separation between states of different j due to 
spin-orbit coupling. 

The perturbation produces a first-order shift in energy of the state 
(1, j, mj, 1) equal to the expectation value of (10.34). It can be shown 
that (p. 61-64 in Condon and Shortley) 


(s,)=c(Jz) 
where 
ow AAI = 10 +1) +5641) Aas 


270 +1) 
Hence the change in energy of the state (n, j, mj, 1) on introduction of 
the magnetic field is 
e 
1 +c) —— Bmp. : 
(1 +e) om oi (10.36) 


The (27 + 1) states with different values of mj now have different energies 
and the energy change is proportional to the component of total angular 
momentum parallel to the field. 


B=0 B#0 mM; 
vie 
f=Lj= 3 <7 
SSE -4 
~~ -_i 
2 
Spin-orbit 
coupling 
Payoh <a 3 
~ -4 


$=0,j=4 ———-<1----- fenaym 


Figure 10.1 The Zeeman effect. 
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From (10.36) the gyromagnetic ratio is 


e 
1 +c) — 
(tee 


and so the g-factor for the system is (1 +c), i.e. 


iG+1)—-1@d+1)+sCotl) 
+ acres S/o | ea +) (10.37) 


For a single electron s = 5 and for 


g=l 


poltd, gait 


and for 


j=] — =f 
J I-34, & 21+1 


For an s-state / = 0 and g = 2 and the two levels split by 
B 
eh — 
m 


The anomalous Zeeman effect of the alkalis is explained in this way. 


/xs+1) 


Vit! 


A/ii+1) 


The effective average magnetic 
moment is the component # 
along the direction of 

total angular momentum. 

w= B, COS O,;+ pcos 0, 


Figure 10.2 The vector model for the Lande-factor. 
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In a strong magnetic field the interaction term (10.34) may be large 
compared with the spin-orbit coupling. Then the spin-orbit interaction 
is regarded as the perturbation. This is the Paschen-Back effect. 


PROBLEMS 


1 Verify that the matrix representation (10.14) satisfies the commu- 
tation relations (10.4). 
Show that the matrices representing s, and sy have eigenvalues +//2. 
Show further that the matrix representing s? is 


h? [3 0 
4\0 3 
and deduce s? has only one eigenvalue 3h? /4. 


2 Define operators 4,8, y by the relations 
a = 2s,/ih, B = 2s,,/ih = 2s,/ih 
Using (10.14) show that 
a? =g? =y? =— 
and further that 
af = —Ba=¥ 
By=—YBp=a 
ya= —ay=B. 


These results show the relation between the spin operators and 
quaternions. 


3 Consider the Pauli spin matrices (10.16). Verify that 
o2 =o, =a; = +1 

and obtain the three commutation relations of the form 
O,0y — Ga, = 2ia,. 

From these commutation relations show that 
6,G, + aya, =0 

and deduce 
©, Oy =idz. 


Compare these results with the similar results obtained for the matrices 
a,B, y in question two. 
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4 From equations (10.3) and (10.14) deduce that the 2 x 2 matrices 
representing the infinitesimal rotation operators /$*), is) and J$*) in the 
irreducible representation D(!/2) are 


i 0 _ i(° | i(! i 
1 0 i 0 Qo -l1 
respectively. Verify that the commutation relations (10.4) are satisfied. 


Use equation (9.46) to show that the corresponding matrix 
representing the finite rotation P*(¢, z) is 


ele? 0 
QO verfe/2y° 


5 In the isotopic spin theory protons and neutrons are regarded as two 
distinct states of the ‘nucleon’. The two states are defined by the two 
possible values of the isotopic spin co-ordinate 7,. 7, = +1 and 7, = —1 
correspond to the proton and neutron respectively. A nucleon wave- 
function can formally be written 


W(X, Ys 2 Tz) = V+, y, Z)a(T2) + W_(, Y, 2)B(T2) 
with 

a(+1)=1 and a(—1)=0 

B(+1) =0 and B(-1)=1. 


In analogy to the electron spin matrix representation, confirm that 
the matrix representation 


e (a) Oy) hey 


is valid. 
The electron charge of a nucleon is represented by the matrix 


e 
=— [I +,]. 
q=5 +t] 
Confirm that the proton and neutron state vectors are eigenvectors of q 
with eigenvalues +e and 0 respectively. (For further information the 
reader is referred to Heine’s book.) 


6 When a magnetic field of induction B is applied to an electron and if 
all other contributions to the energy are neglected, the Hamiltonian is 


e 
[s,B, +s,By + s,B,]. 
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Show that the eigenvalues of this Hamiltonian are 
fet =p where B=(B2 +B? +B?)'”. 
m 


If Bis at an angle @ to the z-axis then 
B, =B cos 6 and B2 +B} = B? sin? 0. 


Show that in this case if the normalized two component wave vector is 
wi 
then 
(7) 6 
iwa.l? = cos? a and Iw_ |? = sin? > 


7 Suppose, in question 6, the field is along the z-axis, then the 
Hamiltonian is 

eB 

—s,. 

m 
It has been shown in section 10.5 that the eigenvalues of this Hamiltonian 
are E = + enB,/2m corresponding to the z-component spin eigenvalues 
th/2. 

However, if the state function is not an eigenfunction of s, the 

equation that must be satisfied is the Schrodinger time-dependent 
equation. 


ech (1 O\(vs\_ 2 (vs 
in (5 )(¢) acl} 


Verify that if initially at t = 0 the state function is an eigenvector of s, 
with eigenvalue +//2, then at time f, the state function is 


bs -_1 [exp [—iet/2] 
ve V2 | exp [iwt/2] 
where 


ww = eB,/m. 


Confirm also, that at t = 7/2w the state function is an eigenvector of sy 
with eigenvalue A/2 and at t = 27/w the state function is once more an 
eigenvector of s, with eigenvalue f/2. 
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These results show how the spin magnetic moment vector processes 
about the applied magnetic field with angular frequency eB,/m. (This is 
identical to the classical frequency.) 


8 The relativistic change in mass of an electron may be taken into 
account by considering the perturbation 
kh =~ p* /8, Whe C2 


where p is the operator representing linear momentum uy is the reduced 
mass and c is the speed of light. 
Explain why 


: Ze* 2 
pty =4y? le+3 | v) 
TE of 


where y and £ are the eigenfunctions and eigenvalues for a hydrogen-type 
atom. Hence show that the first-order perturbation of the energy due to 
relativity, is for the state (m, J, m;) 


1 3 
Be lars +l 3s| pe®Z*/(4ne9)*h4c?. 


This term should be added to the spin orbit coupling term in (10.31). 
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Systems of Particles 


11.1 Introduction 


The basic theory developed through the postulates in the earlier chapters, 
applies to many particle systems. In Chapter 4 it was shown that the 
Hamiltonian operator representing a system of two particles of masses 
m, and mz at positions (x;, 1,21) and (x2, y2, Z2) respectively is (4.45), 


h?\a? a? — a? h? | a? a? a? 
= — |] —+ + —s + —] — —— !|- + a tS 
2m,|ax? ay? 927] 2m, Lax? ay? az} 
+ V(%1,91521,%2, 2522). 


The state function Y depends upon all six co-ordinates and the time and 
satisfies Schrédinger’s time dependent equation (Postulate 3), 


The normalization condition takes the form 
Jfwrw ae, ay, dzy axe dy dzy = [fvewar, ar, =1 


where the field of integration is over all possible values of the six co- 
ordinates. 

These results are easily extended to a system of ” particles. The 
classical Hamiltonian is 


n p? 
= aa + V(t, 12,-- +I) 


i=1 
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where r; is the position of the ith. The quantum mechanical Hamiltonian 
operator is 


n 
_ nh? 92 a2 a2 
H=- ra teaeaes +V(t1,12,-..5%). (11.1) 


The state function depends upon the values of all the co-ordinates and 
the time and again satisfies the time dependent Schrédinger equation. 

An important Hamiltonian is that of a free atom with n electrons. 
Ignoring the translational energy and the spin-orbit interaction terms, 
this operator is 


i 2 Zr ISS e? 
=. 2 gras een eee 
id 2m 2 > Aneor; se > Aneég|r; —¥;| 
i=1 i= i=1j=1 
(Gi #/) 


(11.2) 


where m is the electron mass and Z is the atomic number. The operator 
V? operates only on the co-ordinates of the ith electron and |r; — ¥j| is 
the separation between the ith and jth electrons. 


11.2 Angular Momentum of Systems 


Orbital angular momentum 
Classically, the orbital angular momentum, of a system of n particles, 
about the origin is 
n 
L= 2 1; A pi 
i= 


where p; is the linear momentum of the ith particle at position r;. The 
quantum mechanical operator representing the z-component of the 
angular momentum of the system is 


224 (nd ng ed Lr (11.3) 


is the operator representing the z-component of the ith particle. Similarly 
it is possible to define the operators YG, and Ly representing the x and 
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components of the orbital angular momentum of the system and the 
quare of the angular momentum of the system is 


L2=L2+L2+L?. 


It is easy to deduce that the angular momentum operators of the 
system satisfy the same commutation rules as those for a single particle, 
thus 


Lx Ly - Ly Lo=ihL, ete. 


(Note that Y,,, Zjy,i# j, etc. commute as they refer to different 
co-ordinates.) Also #? commutes with Y,, Ly and J. 

The angular momentum operators can be related to infinitesimal 
rotation operators. From (11.3) 


n 
L=h 2 IP (11.4) 
i= 
where if ) is the infinitesimal operator that operates on the space co- 
ordinate of the ith particle only. By a similar argument to that given in 
section 10.4 concerning rotation operators and total angular momentum, 
it can be shown that the summation on the right hand side of (11.4) can 
be replaced by a single infinitesimal operator producing a simultaneous 
rotation of the space co-ordinates of all the particles. Consequently, 
orbital angular momentum is conserved if the system is invariant under 
a simultaneous rotation of all the space co-ordinates. This is the case for 
the approximate Hamiltonian (11.2) for a free atom. 

The argument given in section 9.6 implies that F? has a discrete set 
of (2L + 1) eigenfunctions, 2Z an integer, with common eigenvalues 
L(L + 1)h?. Each of these functions may be chosen to be an eigen- 
function of Y, with separate eigenvalues 


M,h=Lh, (L—1)h,...,-Lh 


Single-valuedness of the space wave-function once again requires L to be 
an integer. (Note that Z, M, are used as the quantum numbers for a 
system of particles and J, m; are used for a single particle.) 

The approximate free atom Hamiltonian (11.2) is invariant under a 
simultaneous rotation of all the co-ordinates and so commutes with the 
system angular momentum operators. The quantum number L is a good 
quantum number and may be used in the classification of states. The 
states with L =0,1,2,...are labelled by the letters S, P. D,.... 


Product space 


In a two-particle system where the two particles do not interact, the 
Schrédinger equation is separable and the two-particle wave-function 
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can be expressed as a product of one-particle wave-functions 
Wri, 2) = Wilts W2(r2). 


Products of this type can be used to define a product space. 

The term ‘function space’ was defined in Chapter 3. Suppose the 
linearly independent set of functions {6;},i=1,..., forms the basis 
for an n-dimensional function space and similarly {¢},j=1,...,m 
forms the basis for an m-dimensional space. The n x m linearly indepen- 
dent products 6;9; are the basis functions for an n x m-dimensional space 
which is called the product of the original spaces. Any function in this 
product space can be expressed 


nm 
2D Ai: 
Le | 


where the n x m coefficients Ajj are constants. 


Addition of angular momenta 


Consider a system of two particles and suppose that the operators 
representing the angular momentum components of each particle 
commute with the system Hamiltonian so that the angular momentum of 
each particle is conserved. This implies that the Hamiltonian is invariant 
under a rotation of the co-ordinates of each particle separately. This 
would happen in the case of a two-electron atom where the electron- 
electron interaction term (the last term in (11.2)) is ignored or approxi- 
mated by a smoothed out, separable, central-field potential. 

The operators F?, F,, £7, £3 where F?, F, refer to the angular 
momentum of the system and #?, #3 refer to the separate particles, . 
commute with one another and with the Hamiltonian and may be 
measured simultaneously. Alternatively the operators £?, L,,, £3, 
Lr, where L,,, L2, refer to the separate particles, commute with one 
another and with the Hamiltonian and so their eigenvalues may be known 
simultaneously. Either of these four sets of four angular momentum 
operators, when combined with certain other operators will form a com- 
plete set. Assuming the other operators have fixed eigenvalues the state 
of the system is specified by either the set of quantum numbers (i) Z, M;, 
1,,l, or by Gi) 1,, 771, 14, m2. The quantum numbers L, M, refer to the 
system while /,, m, and /,, mz refer to the first and second particles 
respectively. 

To find the different possible values of FY? associated with the first 
set of operators it is useful to consider the labelling of the different states 
arising from the second set. 

Consider the second set #?, £,,,£%, L,,. Fora given /;, the quan- 
tum number 7m7,, can take (27, + 1) values and similarly for given I, m2 
can take (2/7, + 1) values. So for fixed /, and 1, there are (21, + 1)(Q/2 +1) 
different degenerate states that span a sub-space of the function space of 
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the system. The eigenfunctions may be written W (/,,1,,m,, m2) and the 
largest allowed values for m,, mz are 1,,1, respectively. The space of the 
two-particle system can be expressed as the product of two spaces each 
describing a single particle, i.e. 


Wh, h, m,, m2) > o(1;, m)O(l2, mp) . 


where $(/,, 721) and $(/z, m2) describe the first and second particles 
respectively. 

Consider now the first set of operators £2, Y,, £3, L32. For a fixed 
Z, and /, there must be (2/, + 1)(2/, + 1) different states spanning the 
same sub-space of the system space. These eigenfunctions may be 
written y (LZ, M_, 1,, 12) and are obtained by taking appropriate linear 
combinations of the eigenstates of the second set of operators. The 
coefficients are the Clebsch-Gordon coefficients, i.e. 


VL, M,, li 1) = Ps » ed, Lb, Mi m,|L, Mr) di, lo, m1, m2). 
m,m, 
Since 
f, = Ly, + Lz 


the largest allowed value for M;, is (/, + /,) obtained by combining 
m, =1, and mz = 1). It follows that the largest L is also (1; +12). This 
system wave-function is the product $(J;,1,)@(2, l2). 

There are two states in the second set with the next value for M,. 


M, =(, +h) — 1. 


These occur with m, =1,,mz =1, —1 and m, =1, —1,m2 =1,. By 
suitably combining these there must be two states in the first set with 
this value for M; . These are 


L=(, +h), M, =£-—]1 and L=G, +1, —1), M, =L. 


There are three states in the second set with 
M, = (1, +1,)—2 


and hence three such states in the first set with L =(/, +/,), 7, +1, —1) 
and (J; + J. — 2) respectively. For the next value 


M, =(1, +h) —3 


there are four states and so on. 
The number of states with a given M, does not increase indefinitely 
and in fact the maximum number occurs for 


M, =, —~hl. 


This implies that the minimum value for L is |, — 1,|. 
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Finally, for a given 1, , /) the system quantum number L can take the 
positive values 


L=(, +h) Gtk -1),-.5lh -bl 


(Il, th) SLE —Iy). (11.5) 


This is the triangle inequality. If 7, >/, there are (2/, + 1) distinct values 
for L. For each value of ZL there are (2Z + 1) values of M;, and the total 
number of states is 


(i, +4) 
XY @2L4+1I=2h +Y)Qh+1) 


1 2 


as required. 

The vector coupling rule (11.5) can be used to add three or more 
angular momenta of particles together by successive applications. The 
reader is advised to compare the work above with the discussion of total 
angular momentum in section 10.4. 

There is an equivalent group theoretic way of discussing the addition 
of angular momentum. 


Product representation of a group 


Let the two function spaces {0;} and {¢;} each be invariant under the 
group of operators {A}. The product space is also invariant under the 
same group of operators and the set of functions {6;¢;} forms ann x m- 
dimensional representation of the group. In general the product space 
will be reducible into a sum of sub-spaces each irreducible under the 
group. 

As an illustration consider the two sets of functions {W y»,}/ = 3, 
m, = 3, 2,1,0, -1, —2, —3 and {Wy ,}/= 1, m, = 1,0, —1 which trans- 
form according to the irreducible representations D() and D@) respect- 
ively of the full rotation group. (Note that W 3m, = P3! (cos 0)e!/® etc.) 
The 21-dimensional product space is invariant under the rotation group 
and transforms according to some reducible representation which is 
denoted by D() x D@. The 21 functions in the product space can be 
split up into three sets, one containing nine functions forming the basis 
for the irreducible representation D( another containing seven functions 
and transforming according to D(3) and finally a set of five functions 
transforming according to D2), The result can be written 


D®) x DO) = p@ +p@) +p®), 
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For a more complete discussion the reader is referred to Group Theory 
in Quantum Mechanics, by V. Heine, or Group Theory and Quantum 
Mechanics, by M. Tinkham. 

More generally, if two sets of functions form the basis for irreducible 
representations D“/1) and D(2), the product space transforms according 
to the reducible representation 


Du» x DU») = DM +J2) +Du: tha-I yg _ +Diiidal, (11.6) 


This is of course another statement of the vector coupling rule (11.5). 


Parity 
Consider the inversion operator J which is defined by 
I(ty, 12, -. In) = W(t, ta, -- 5s tn). 


That is, it changes the sign of each of the position variables of all the 
particles in the system. The eigenvalue equation for the inversion 
operator is 


Ty = py 


where p is an eigenvalue and yj is an eigenfunction. If the inversion 
operator is applied twice the function must be converted back into 
itself. The operator /? is equivalent to the identity £ and 


Py=pysy 
p?=1 and p=tl. 
The inversion operator has two eigenvalues p = +1 and the eigen- 
functions are either 
(a) unchanged in sign when acted upon by the inversion operator (even 
functions) or 
(b) changed in sign when acted upon by J (odd functions). 


The ‘state is said to have even or odd parity respectively. 

For some systems the parity eigenvalues are good quantum numbers. 
Consider a closed system of particles with no external forces acting. 
Then the potential energy depends only upon the particle separations and 
the quantum mechanical Hamiltonian is 


ee ae ee 

#=-7 2 5¥i +> Dn 9). (11.7) 
i iF] 

Clearly # commutes with the inversion operator, i.e. 


H1-1H=0. 
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Functions can be chosen which are simultaneously eigenfunctions of both | 
Hand I. 
In a central field there is a term 


2 Vr l) 


to be added to the Hamiltonian above (see (11.2)). This Hamiltonian also 
commutes with J and the parity eigenvalues remain good quantum 
numbers. 

The inversion operator also commutes with the angular momentum 
operators of the system. In particular 


L,I -If,=0 
and 
71-1" =0. 


For a central field problem (or for the Hamiltonian (11.7) the inversion 
operator, the square of the angular momentum, the z-component of the 
angular momentum and the Hamiltonian form a set of commuting 
operators and can be simultaneously measured. 

Consider a single-particle, hydrogen-type wave-function. In spherical 
polar co-ordinates 


Vnim, = Rn(t) PI"! (cos 6)”. 
The inversion operator corresponds to the transformation 
r>r, @>n —8, o>nt?. 
AS 
el (m+0) = (_1)™1 gid 
and 
PI"'(cos (1 — 6)) = P?"1(—cos 6) = (—1)’" ™! P}"!(cos 8), 


the wave-function is multiplied by (—1)! when operated on by the 
inversion operator, i.e. 


Wnim = eS 1) Wty: 


The parity of the state is independent of the quantum number m, and 
depends only upon / If / is an even integer, the state has even parity 
whereas if / is an odd integer the state has odd parity. 

The parity of an n-particle wave-function which can be expressed as 
a product of n single particle central field wave-functions is 


Cay een 


where 7, is the angular momentum quantum number for the ith particle. 
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The parity concept is useful in considering transition probabilities 
and is particularly useful in classifying nuclear energy states (Theoretical 
Nuclear Physics, by Blatt and Weisskopf). 

The conservation of parity in quantum mechanics has no equivalent 
in classical mechanics as the invariance of the classical Hamiltonian under 
inversion does not lead to any interesting results. 


Spin angular momentum 


The operator representing the spin momentum of an electron is defined 
in terms of the infinitesimal rotation operators, which transform the 
spin-co-ordinates, by equation (10.3). The operator representing the z- 
component of the spin momentum of the system of n particles is 


Sz = , Siz (11.8) 


is the operator representing the z-component of the ith particle. Similarly 
it is possible to define the operators S, and S, representing the x and y 
components of the spin momentum of the system and the square of the 
angular momentum of the system is 


Sd, = S2 +S} +8}. 


The spin momentum operators of the system satisfy the same commu- 
tation rules as those for a single particle. 
From (11.8) 


= 76 
S,=h > 12? (11.9) 


i=l 


where Ifs ) is the infinitesimal operator that transforms the spin co-ordinate 
of the ith particle only. The right hand side of (11.9) can be replaced by 
a single infinitesimal operator producing a simultaneous rotation of the 
spin co-ordinates of all the particles. So, spin momentum is conserved if 
the system is invariant under a simultaneous rotation of all the spin 
co-ordinates. This is the case for the approximate free atom Hamiltonian 
(11.2). In fact this Hamiltonian is invariant under a rotation of the ith 
co-ordinate alone and so the spin momentum of each separate particle is 
conserved. 
; The argument given in section 9.6 implies that Sdp has a discrete set 
of (2S + 1) eigenfunctions, 2S an integer, with common eigenvalue 
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S(S + 1)h?. Each of these functions may be chosen to be an eigenfunction 
of S, with eigenvalues 


M,h = Sh, (S—1)h, ..., —Sh. (11.10) 


(The reader is warned not to confuse the operator Sop with the quantum 
number S.) For a single electron S=s =3. 

Consider a system composed of two electrons. If the spin interaction 
terms are ignored, the system Hamiltonian is invariant under a separate 
rotation of each spin co-ordinate and the spin momentum of each elec- 
tron is conserved. For each separate electron s = 4 and by following an 
argument similar to that leading to equation (11.5), the system spin 
quantum number S can take either of the two values 


S=1 with M,=1,0,-1 (triplet) 

S=0 with M,=0 (singlet). 
Alternatively, from (11.6) 

DEP) y pG) = pO + p©, 


The four products 
0(04z)0(022),  B(O1z)B(22),  &(012)B (022), B(O1z)(022) 


where 0), 02, are the spin co-ordinates of the two electrons, form the 
basis for the reducible representation D/2) x D@/2), From these it is 
possible to construct the three functions 


spin quantum number 


s M, 

&(01z)a(022) 1 1 
B(01z)B (022) 1 —1 

1 
V2 [a(01z)B(G2z) + B(O1z)o(922)] 1 0 
which transform according to D™) and the single function 

RY "My, 

SF (e(e12)B (ers) — B(Orz)a(or2)) 0 0 


which transforms according to D(), All of these functions are ortho- 
normal. From the spin operator viewpoint, each of the three functions 
that transforms according to D©) is simultaneously an eigenfunction of 
Sop with eigenvalue 2A? and also an eigenfunction of S, with eigenvalues 
h, 0, —h respectively. The single function that transforms according to 
DO) is an eigenfunction of Sg) and S, with common eigenvalue zero. 
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The reader is advised to verify the results using equations (10.17), 
(10.18) and noting that (from, (11.8)) 


2 =fe2 2 2 2 2 2 
Sop _ (Six tSiy + Siz) + (82 +53y +$3,) 
+ 2(81x52x tSyyS2y +8 12522). 


Total angular momentum 


The total angular momentum of the system is defined to be the sum of 
the orbital angular momentum and the spin angular momentum of the 
system. In particular 


J,= £, +S, (11.11) 
and similarly for J,, and Jy. The square of the total angular momentum is 
J2sJ2+ 52 +5? (11.12) 


and these operators satisfy the usual commutation rules. 
From (11.4), (11.8) and (11.11) 


n n 
J,=h DUP +ID)=An D1? (11.13) 
i=1 i=1 


where i is an infinitesimal operator producing a simultaneous rotation 
of both the space and spin co-ordinates of the ith particle. The right hand 
side of (11.13) can be replaced by a single infinitesimal operator pro- 
ducing a simultaneous rotation of all the co-ordinates of all the particles. 
Consequently, total spin momentum is conserved if the system is 
invariant under a simultaneous rotation of all the co-ordinates, both 
space and spin. This is the case for the general free atom Hamiltonian 
even when spin-orbit coupling is included. 

The argument of section 9.6 implies that J? has a discrete set of 
(2J + 1) eigenfunctions, 2J an integer, with common eigenvalue J(J + 1)h?. 
Each of these may be chosen to be an eigenfunction of J, with separate 
eigenvalues 


Myh = Jh, (I~ 1)h,...,—Ih. (11.14) 


The addition of the total angular momenta of the particles in a system 
follows along the same lines as that leading to equations (11.5) and 
(11.6). 


11.3 Identical Particles 


In quantum mechanics the uncertainty principle makes it impossible to 
define precisely a particle trajectory. Because of this a given particle, ina 
collection of identical particles, cannot be labelled. This is not the case 
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in classical mechanics. A classical particle has a definite trajectory and it 
is possible to distinguish individual particles by following their paths. 

So, in quantum mechanics, unlike classical mechanics, there is, in 
principle, no physical way of distinguishing between identical particles. 
This can be formalized by introducing permutation operators. A permu- 
tation operator Pj; interchanges the labels of the 7 and 7 particles. In 
particular 


Py W(t, 123 82, F2z) = W(t2, 622541, F12) (11.15) 


where r;, 0;, are the space and spin co-ordinates of the ith particle. As 
identical particles are indistinguishable it is not possible to observe any 
change in a system if two such particles are interchanged. This means 
that if particles are relabelled, a normalized function must only change 
by a phase factor, i.e. 


Pip W(t1, 012582, G22) =e W(t1, 0125 Fp, O22). (11.16) 


If the interchange is repeated the original wave-function must be 
obtained and so P;. has two eigenvalues given by (compare the 
inversion operator), 


eit = +1 
i.e. 
ef =41, (11.17) 


The quantum mechanical principle of indistinguishability requires 
that a many-particle wave-function behaves in one of two ways when 
two particle are interchanged (i) either the wave-function is unchanged, 
i.e. symmetric, or (ii) the wave-function changes sign, i.e. antisymmetric. 
For consistency, it is clear that a wave-function describing a system with 
three or more identical particles cannot be symmetric with respect to 
some interchanges and antisymmetric with respect to others. The wave- 
function is either symmetric or antisymmetric with respect to all 
interchanges. 

The Hamiltonian describing a system of identical particles must be 
invariant when the particle co-ordinates including spin are relabelled. 
The permutation operators must commute with the Hamiltonian and 
the permutation quantum numbers (11.17) are good quantum numbers. 
So the wave-function is either permanantly symmetric or it is permanently 
antisymmetric, which of these being determined by the nature of the 
particles rather than the state of the system. 

Particles described by symmetric wave-functions are said to satisfy 
Bose-Einstein statistics whereas particles described by antisymmetric 
wave-functions satisfy Fermi-Dirac statistics. It can be shown from 
relativistic quantum theory that particles with integral spin are Bose- 
Einstein particles (Bosons), and that particles with half-integer spin 
(e.g. electrons, protons, neutrons) obey the Fermi-Dirac statistics. 
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It is easily shown that two Fermi particles cannot occupy the same 
position in space and also have the same spin. Consider a wave-function 
describing a system of Fermi particles. If two of the particles are inter- 
changed the wave-function must change sign. However if these two 
particles have the same set of four co-ordinates (space and spin) then 
the wave-function is clearly invariant. 

This implies that the wave-function vanishes when two particles 
have the same set of co-ordinates indicating that this state does not exist. 
This is one form of the Pauli exclusion principle. 

If the relativistic spin dependent terms in the Hamiltonian are ignored 
the wave-function can be written as a product of a function depending 
on the space co-ordinates multiplied by a function of the spin co- 
ordinates. Consider the case of two Fermi particles. The product wave- 
function is 


W(t1, 0123 F2, F2z) = (84, F2)X (O12, O22). (11.18) 


As the complete wave-function must be antisymmetric in the co-ordinates 
(including spin) then $(r, r2) must be either symmetric in the space 
co-ordinates, in which case x(0}z, 22) is antisymmetric in the spin 
co-ordinates or vice-versa. Then the wave-function (11.18) is a simulta- 
neous eigenfunction of three commuting operators, the Hamiltonian, the 
particle interchange operator and the operator that interchanges the 
spin co-ordinates only. The system spin operators S3 and S, commute 
with the three mentioned above and so the wave-function can also be 
chosen to be a simultaneous eigenfunction of all five operators. These 
wave-functions are the correct zero order functions for a perturbation 
treatment regarding the spin terms as a perturbation. 


Non-interacting particles 


Consider a system composed of n identical particles whose mutual 
interaction can be ignored. The Hamiltonian #of the system is the sum 
of n equal Hamiltonians for the individual particles, i.e. 


with 
h? 
eae V? + V(t, Giz) 


where V? operates on the position co-ordinates of the ith particle at 1. 
The potential V is the same for each particle. 

As the individual particle Hamiltonians all commute with each 
other and with #%, wave-functions can be found that are simultaneously 
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eigenfunctions of #and all the #;. These are the variable separable 
eigenfunctions 


Wri, C123 ¥2, 0223 - «3 Ta» Onz) = 0, (t1,01z)--- On (fins Onz) 
(11.19) 


where 


F,o0,(11, O12) = Eo9,%0;(1, G42) 


and 


i=1 


The subscripts Q;,..., Qn, indicate the completely defined (including 
spin orientation) single particle energy states occupied. If the co- 
ordinates of any two particles are interchanged the wave-function 
(11.19) remains an eigenfunction of the system Hamiltonian with the 
same energy. As there are n! permutations of n objects then n! such 
eigenfunctions can be generated. Note that none of these eigenfunctions 
is in general symmetric or antisymmetric with respect to particle inter- 
changes. However the sum of all ! functions is clearly symmetric. If the 
particles are electrons (i.e. Fermi particles) the many electron wave- 
function must be antisymmetric. Such a wave-function is obtained by 
forming the sum of all wave-functions obtained from (11.19) by an even 
number of particle interchanges and substracting the sum of all wave- 
functions obtained by an odd number of interchanges. This antisymmetric 
wave-function can be expressed as the determinant 


Wri, 125 -- 5 Eas Onz) 
1 | ¢0,(%, O12) $9, (ta, 022)... 99, (fn, Onz) 
vat ¢9,(%1, 012) $9, (t2, O2z)..- 69, (ns Onz)}. (11.20) 


¢0,(ts; O42)... $0,(n» Onz) 


If the single particle wave-functions are normalized (and orthogonal) then 
so is y. Clearly if the co-ordinates of any two particles are interchanged 
the determinental wave-function (11.20) changes spin. (Note a deter- 
minant changes sign when two rows or two columns are interchanged.) 
Note that if two of the occupied one-electron states are the same the 
determinant (11.20) has two rows equal and so vanishes. The system 
Hamiltonian has no solution for which the completely defined one 
electron states are occupied by more than one electron. This is of course 
the Pauli exclusion principle for non-interacting particles and helps to 
explain the periodic system of the elements. At most there can be two 
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| electrons in each hydrogen type atomic orbit and they must have 
opposite spins. 


The Helium atom 


In section 7.2 the ground state of the Helium atom was discussed using 

the variational method. The ground state and lowest excited states will 
now be investigated using first-order perturbation theory. The Hamiltonian 
operator for the Helium atom, ignoring spin dependent terms and the 
translational energy, is 


2 


2e 2e? e? 


nh? 
H = —- — 2 +V2 _ _ 1S, 
2m Vi 2) 4n€ or, 4n€or2 4nég|ry = rz | 


(7.17) 


where the two electrons are at r; and rz respectively. The spin momenta 
operators of each particle and of the system all commute with H# and 

the total spin quantum number can only take the values S = 0 or S= 1 
corresponding to antiparallel and parallel spins respectively (section 11.2). 
It will be shown below that even when spin dependent terms in the 
Hamiltonian are ignored, the allowed energies of the system do depend 
upon the total spin quantum number. This ‘exchange’ effect arises 
because of the use of a determinantal wave-function. 

If the electron-electron term in (7.17) is ignored the exact ground 
state occurs when the two electrons with antiparallel spins occupy the 
hydrogen type (1s) state (Pauli principle). This is the atomic configuration 
(1s)? and the wave-function is 


_1 | bis@pe(oiz) 1s 2)@ (G22) 


bis(ty)B(O1z) O15 (T2)B (O22) 


W(t, 0123 ¥2, O22) = 


= $15 (11) 15 (t2) a [a (012)B (022) 
— B(Gz)a(922)] (11.21) 


where ¢;,(r) is the hydrogen type wave-function 


1 (2\32 soars 
dis) = 4 e os 


The spacial part of the determinantal wave-function is symmetric 
whereas the spin part is antisymmetric. The operators S3p and S, have 
zero eigenvalues as the electrons have antiparallel spins and the orbital 
angular momentum number is L = 0. (Note, the operator representing 
the orbital angular momentum of the system commutes with the 
Hamiltonian (7.17).) All states with a given L and S form a ‘term’. The 
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wave-function (11.21) represents the ‘singlet’ term 1§. The S being the 
spectroscopic notation for a state with Z = 0 and the superscript 
represents the spin degeneracy (2S + 1) of the state. (It is hoped the 
reader will not confuse the two distinct meanings given to S.) 

The nondegenerate determinantal wave-function (11.21) can be used 
to give the energy of the ground state of the Hamiltonian (7.17) correct 
to first order, with the electron-electron interaction term regarded as a 
perturbation, i.e. 


E=2E(\s) 
: 2 
+ [J oiserret a) FT — a Paaltv)ora(t2) dr dra 
where 
nr 2 2 
E(1s)= { eis) |- a Vi - Ze bis (ti) d74 


is the energy of a single electron in the (1s) state assuming no other 
electrons present. The first terms represents the kinetic energy of the two 
electrons together with their potential energy in the field of the nucleus. 
The second term represents the electrostatic interaction between the 
electrons. The value of the ground state energy obtained in this way is 

E = —2:75e?/4n€gao and is not as good as that obtained in section 7.2 
using the variational method. 

The more important excited configurations occur when one of the 
electrons is excited into a hydrogen type level with n = 2 to produce the 
configurations (1s)(2s) or (1s)(2p). It is not difficult to show that matrix 
elements between these two types of excited configurations vanish so 
that the (1s)(2p) states can be dealt with separately. Consider the 
(1s)(2s) configuration. There are four possible determinantal wave- 
functions depending upon the spin orientations of the two electrons. 

By suitably combining two of these determinantal functions, the four 
possible antisymmetric wave-functions can each be expressed as a 
product of a spatial function multiplied by a spin function and can 
be divided into two classes depending upon whether the spin part is 
symmetric or antisymmetric. These are the correct zeroth-order wave- 
functions for a perturbation treatment of the spin-orbit coupling. 

One of the wave-functions has an antisymmetric spin part (and 
hence symmetric spatial part) and corresponds to the singlet term 15 
with L =O and S=0. 


viz 5 [15(13) basta) + Gap(t1)br9CF2)] 


x oA [a1(012)B(022) — B(G12)0( 022). 
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The remaining three wave-functions have a symmetric spin part and 
correspond to the triplet term 3S with L =O and S= 1. 


We “FZ [615(F1) 25(t2) ~ b25(1) O15 @2)] 

x  la(012)B (022) + B(or-)a(%r) 
W3= Fz [$15(t1)25(F2) — Pas (11) O1s(t2)] @(012)0 (022) 
Va= a [G15(t1)@as(t2) — bas(r1)$1s(t2)] B(O1z)B (O22). 


The last three wave-functions all have the same antisymmetric spatial 
part, and vanish if the electrons occupy the same point in space. The 
electrons tend to keep apart in this case and the positive contribution to 
the energy arising from the repulsive electron-electron interaction term 
will be smaller than that arising from the symmetric spatial part of the 
term !S. So, it is to be expected that the perturbed 3S triplet term will 
lie deeper than the !S term. This will be shown to be the case. 

When the electron-electron interaction term is ignored all four states 
(singlet and triplet) are degenerate. Although the unperturbed states are 
degenerate, the perturbation matrix is already diagonal with the above 
choice for the excited wave-functions. To the first order, the perturbed 
energy of the excited singlet state 1S is the expectation value of (7.17) 
using Y, and is 


E('S) =E(s)+E(2s)+J+K (11.22) 


where E(1s) and E(2s) are the energies of a single electron in the (1s) and 
(2s) states respectively and 


* * e 
J= | | bis 1) G25 (t2) 15 (1) $2 (t2) dT, d72 
4ne€g|t; —1;| 
* * e? 
K= [J Oi e)08() PT — 5 Palla) oan(t1) dry dr. 
ey 


J is called the Coulomb integral and represents the average Coulomb 
energy of the two electrons in the (1s) and (2s) states respectively. K is 
called the exchange integral and is a direct consequence of using a deter- 
minantal wave-function instead of a simple product. Unlike J it does not 
have an electrostatic interaction explanation and arises as identical 
particles cannot be distinguished. 
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The perturbed energy of the excited triplet term 3S is 
ECS) =E(1ts)+£(Qs)+J—K. (11.23) 


Note that the triplet degeneracy of this term is not removed by the 
electron-electron interaction. The difference between the 1S and 38 
terms is 2K and as it can be shown that K > 0, the triplet terms lies 
deeper than the singlet term as expected. 


'S 


ap ee IS Perturbed 
levels 


Unperturbed level 
4-fold degenerate. 
E = E(1s)+ E(2s) 


Figure 11.1 Displacement and splitting of the configuration (1s)(2s) by the 
electron-electron interaction, 


Actually this first-order perturbation treatment is not very good as 
the electron-electron interaction term is too large. However, from 
symmetry considerations, the quantum numbers L and S are still good 
in an exact treatment and the 3S term is still triply degenerate. When 
the spin-orbit coupling term is introduced, the Hamiltonian is no longer 
invariant under a rotation of the space or spin co-ordinates separately 
and L and S are no longer good quantum numbers. However, it is 
invariant under a simultaneous rotation of all co-ordinates and the 
states can be classified by the total angular momentum quantum 
number J. 


The hydrogen molecule 


Even though this is the simplest molecule involving more than one 
electron no exact solution exists. The Hamiltonian for the molecule, 
ignoring spin dependent terms and taking the nuclei to be fixed, is 
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where 71,71, ate the distances of the first electron from the protons 
which are at a and } and similarly rg, 72» are the distances of the second 
electron from the protons. The electron separation is 7,2 and the protons 
are a distance R apart. 


we 


as aaa gte 


R 


Figure 11.2 The hydrogen molecule, 


In Chapter 7 the molecular orbital method was applied to the hydrogen 
molecular ion and it was shown that the ground state orbital is the 
gerade, g state 


We (Wy + Wo) 
e VY20 +) 


where /,, W2 are the normalized (1s) wave-functions centred on the two 
protons and A is the overlap integral. This represents a bonding state in 
which electron charge is transferred to the bond. This molecular orbital 
corresponds to a binding energy for H} of ~1-8 eV compared with the 
observed binding energy of 2-8 eV. (Binding energy of a molecule = 
ground state energy of isolated atoms — ground state energy of 
molecule.) 

Clearly this simple model is not very good although it does illustrate 
the bond concept. Electron charge is transferred to the space between 
the atoms and holds them together. Confining an electron in a bond will 
increase its kinetic energy and binding will only result if the decrease in 
potential energy is greater than the increase in kinetic energy. 

The hydrogen ion bond orbital can take up another electron with 
opposite spin and the energy of the electron pair in the bonding orbital 
will be lower than that of the two isolated hydrogen atoms and binding 
energy will result. In this crude model, the two-electron wave-function 
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for the hydrogen molecule is (c.f. (11.21)) 
1 [M_tia(o12) We(t2 a (022) 


Vell > Zz > 2) =e 
sa ek v2 Welty )B(O1z)  We(t2)B(022) 
= Volts) Velt) a [4(012)8(022) 


= B(o,2)a (022)] a. 


The symmetric spatial part may be expanded to give 


VelO)Ve ta) = 3 ay Yaa lta) + Valtsdve) 


+ Wi (ty) Vote) + Wilte) W201). 


This wave-function gives equal weight to the ionic states in which the 
two electrons are on the same atom and the remaining states in which 
each atom has one electron. Electrons will repel each other and this 
suggests that the ionic states should not be as important as this. This 
correlation may be taken into account by configuration interaction in 
which the two-electron wave-function is expressed as a linear combination 
of determinants. 

The lowest determinant that will mix in is that which is obtained by 
exciting both electrons to the antibonding orbital y,,(r). The spin part 
of the two-electron wave-function W,,(r1, 9123 12, 022) is antisymmetric 
as above and the spatial part is 


Wales Wales) = 35 —Zy WaltsdWaCta) + ValteWalt) 


— 010) 0202) — Vil) vor )I. 


By taking a suitable combination of the two determinantal wave- 
functions the ionic terms may be reduced or eliminated. The energy 
may be is minimized in this way and the resulting combination is such 
that ionic terms are relatively unimportant. 

It would appear that a better first wave-function would be to take the 
electrons on different atoms to begin with. This is the basis of the Heitler- 
London approach. There are two possible functions to consider. 


(i) There is the singlet state with symmetric spatial part (S = 0) 


as [Yi tr) Y2(t2) + Volt Wi (2) is 
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(ii) and also the triplet state with antisymmetric spatial part (S = 1) 


yates [vi dry 2(t2) — Yat Wi DI. 3E 


(The 2 denotes zero orbital angular momentum about the molecular 
axis.) 

Detailed analysis indicates (problem 8) that the singlet state lies 
deepest and so represents the ground state. This is the opposite situation 
to that occurring in helium. However it is to be expected that the strongest 
binding for hydrogen will occur when the electrons tend to accumulate 
between the protons. This will occur when the electrons can occupy the 
same space state and hence have antiparallel spins. 

Some electron correlation is already built into this method and the 
energy can be minimized by varying parameters in y, and 2. This 
method is obviously correct in the limit of infinite separation of the 
atoms. 

The Heitler-London wave-function and the best two-determinant 
molecular wave-functions are of equivalent accuracy and both involve 
an error of about 0-5 eV in the calculated binding energy. The observed 
binding energy of the hydrogen molecule is 4-7 eV. For more details the 
reader is referred to the article by Coulson and Lewis in Quantum Theory, 
Vol. II, edited by Bates. 


Rotational spectra of homonuclear molecules 


The previous discussions have concerned the identity of electrons. A 
homonuclear diatomic molecule (e.g. Hz) contains another pair of iden- 
tical particles these being the two nuclei. The complete wave-function 
describing the system of electrons and nuclei must be either symmetric 
or antisymmetric under an interchange of nuclei depending on whether 
they have integral nuclei spin or half-integral nuclear spin respectively. 
The proton is a Fermi particle and so the hydrogen molecular wave- 
function, including nuclear terms, must be antisymmetric under an inter- 
change of the nuclei. The nucleus in (180), has zero spin and so the 
complete wave-function must be symmetric under an interchange of 
nuclei. 

Using the Born-Oppenheimer approximation the molecular wave- 
function may be written 


vy = W(el)(nuc) 
where (el) describes the electron motion relative to fixed nuclei (this is 


the molecular wave-function discussed in the previous section), and y(nuc) 
describes the nuclear motion. 
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1 


Further, the nuclear wave-function may be written (ignoring nuclear- 
spin-molecular coupling) 


W(nuc) = Y(vib)W(rot)Y(sp) 


where (vib) describes the vibrational motion of the nuclei, y(rot) 
describes the rotational motion and y(sp) depends on the nuclear spin. 
The vibrational function depends only upon the separation of the nuclei 
(4.30) and is unchanged by interchanging the nuclei. The rotational 
function can be written (Chapter 4).(c.f. hydrogen atom functions) 


W(rot) =P (cos@)e"" K=0,1,2,.... 


Interchanging the nuclei corresponds to 
6>n-6, o>nt+d 


and from the discussion of parity earlier in this chapter it is clear that 
y(rot) is symmetric (anti-symmetric) if K is even (odd). 

Consider the hydrogen molecule. The protons are Fermi particles 
with spin 4 and the complete wave-function must be antisymmetric 
with respect to interchange of the protons. The analysis given earlier for 
the electron spin functions of helium applies and there is one anti- 
symmetric nuclear spin function (S = 0) and three symmetric spin 
functions (S = 1). The normal ground state of (el) is symmetric with 
respect to proton interchange and because of the nuclear spin degeneracy 
there are three times as many odd rotational states as even. At moderate 
and high temperatures in thermal equilibrium three times as many 
molecules will be in odd as in even rotational states. 

If coupling terms between the nuclear spin and the molecule are 
ignored then nuclear spin is a constant of the motion. In fact these 
coupling terms are extremely weak and so a molecule with anti- 
symmetric spin is very unlikely to undergo a transition to a state with 
symmetric spin. The hydrogen molecule may be regarded as composed 
of two independent parts, one with zero nuclear spin (para-hydrogen) 
and the other with spin one (ortho-hydrogen). At normal temperatures 
hydrogen is one-quarter para and three-quarters ortho-hydrogen. Hence 
in a transition between two electron states the rotational fine structure 
of hydrogen shows a 3: 1 alternation of intensities. The selection rule 
SK = +1 (applicable for 2 electron states) prevents any transitions 
between electronic states with the same symmetry and explains the 
general lack of infra-red absorption spectra for homonuclear diatomic 
molecules. 

In general, for a homonuclear diatomic molecule, where nuclei have 
spin J, there are (( + 1)(27 + 1) symmetric spin states and J(2/ + 1) anti- 
symmetric spin states. The intensities of alternate lines in the rotational 
fine structure will then vary as (J + 1): J. Measurements of rotational 
intensities have been used to determine nuclear spins. 
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For a molecule with zero nuclear spin, the nuclei are bosons and the 
total wave-function must be symmetric under an interchange of nuclei. 
The nuclear spin part of the wave-function is a constant and so if (el) 
is symmetric (antisymmetric) the only allowed states are those with K 
even (odd). In transitions between two electronic states of different 
symmetry, alternate lines in the rotational fine structure will be missing 
because of the selection rule for K. 

If the two nuclei composing the molecule are different isotopes of the 
same element then they are no longer identical and the above discussion 
does not apply. 


11.4 Atomic Energy Levels 


It is very difficult to solve problems involving more than one electron 
and complicated numerical procedures are necessary even in the simplest 
case. The difficulties arise because the electron-electron interaction 
contains terms of the form e?/4mé9|1; — 1; | involving the spatial co- 
ordinates of two electrons. However there are approximate methods for 
reducing the many electron problem to a one-electron self-consistent 
field problem. 


Hartree’s self-consistent field 


Ignoring the translational energy and the spin-orbit interaction terms, the 
Hamiltonian operator for a free atom of atomic number Z with n 
electron is 


n 


Bs el es gt 
Te =m Yee ae aS Z 2, “areal = ae 
4h (11.2) 


The first term represents the kinetic energy of the electrons and the 
second term represents the attraction between the nucleus and the elec- 
trons. The third represents the electron-electron interaction, the sum- 
mation being over all pairs of electrons. 

The Hartree equations may be obtained by assuming a product wave- 
function of the form 


Wri, Fa, -- stn) = $9, (1) 99, (T2) - -- Paq(tn)- (11.24) 


It is essential that the one-electron functions be normalizable. Of course 
(11.24) is not antisymmetric with respect to interchange of any two 
electrons but the single electron states are chosen in agreement with the 
Pauli exclusion principle in that each completely defined occupied state 
occurs once only. 
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The best approximation to the ground state by a solution of the form 
(11.24) is obtained by minimizing the expectation value of the 
Hamiltonian. So, the best Hartree product solution is that which 


minimizes the integral 


i i fvtenn, wees MAW, Fa, -- tn) a7, dr. ... dt, 
(11.25) 


‘subject to the restrictive conditions 


[ $8,(r1)40,(1) dry =1 
The method of Lagrange multipliers may be used. Consider the 
function 


r=]. { 68, (1). -. 03, (tn) 


n 142 e 
| & hi PD mele ca [90.00 --- Bon) an ad 


+ = ri[ [ 45,1) ¢0, 1) dr, — 1} 


J 


i=1,2,..,n. (11.26) 


where 
rn? Ze? 
42-2 yp, 
2m 4n€ of; 


and Aj,j=1,...,n are the Lagrange multipliers. 
Suppose each of the one-electron functions $9,(r;) undergoes a small 
change 5¢9,(1;). For a stationary value of the integral (11.25) subject to 


the conditions (11.26) it is necessary that 
61 =0 
‘ a * * * 
ie (---[ 68,01) --- 43) 65 y(n) 


n 1 non e* 
+| > Mit 22, ea #0, (61) --- 5 Q_(te) 


i=ti 


onlin) dr: «dtm % J...) 95,01).--805, (0) --- 


, I) ee 
tog()| 4; + a ap anegit; —¥| | go, (11)... 
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$0, (tx) --- $0, (tn) IT, -. dT, + oF »| f $6 ,(01)5¢0,(t%) dt, 


+ [sean éa] = 


i 3] a | ob0a Ardo,ted 


Ido, (ri)? x Iog,(e))? 
=> > Ane —{{ Spelt 2 Base dr; dt; + x 


i¥f |r; — ty | 
ij#r 


ef Idg,(x)P 
+ > fe | A ay tee $0, (tx) Tx 
1 


k 4n€o leg —¥| 


+ (a corresponding expression in 5¢9,(r%)) = 0. (11.27) 


560, and 5¢g, may be regarded as independent variations and the 
coeftintente of these in (11.27) must be zero. The coefficient of 80, i is 


2 (e)|2 
4n¢9,(tk) [> 2 -< | soyepl a $0,(tx) — €xbQ, (tx) 


Aneo | te — ty } 


=0 k=1,2,...,” (11.28) 


where 


ae 7 [ee #ibo,e0 dr, 


1 ido, (t) 140; ())? 
7 pp Z {dato eoie)" j — dr; dry ~ 


lr; — ty | 
jek (11.29) 


Note that e; is real and that the coefficient of 5¢g, is simply the 
complex conjugate of (11.28). (Note that the operators 4; are 
Hermitian.) 

In conclusion, the best Hartree product, of the type (11.24), is 
composed of normalized one-electron functions that satisfy the n 
Hartree equations (11.28) withk =1,2,...,n. The one-electron 
function ¢g,(1;) is an eigenfunction of the Hamiltonian 


2 2 2 (r.) (2 
#,=-2-y2- Ze +> e Igo,(r2)| 


2m ' 4neory 1 ANE 9 {ry —f2l saa) 


] 
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with eigenvalue €,. Each of these terms may be given a physical signifi- 
cance. The first term represents the kinetic energy of the kth state 
electron, the second terms represents the interaction between the electron 
and the nucleus and the final term represents the Coulomb energy of the 
electron in the average field produced by all the other electrons. The 
Hamiltonian is not the same for all electrons and the eigenfunctions are 
not orthogonal. 

The set of n non-linear integro-differential equations (11.28) is too 
difficult to solve directly. Hartree (1928) suggested a self-consistent 
approximation. An initial set of one-electron functions is assumed and 
used to obtain the Hamiltonians (11.30). The Hartree equations (11.28) 
are then solved numerically and a new set of one-electron functions | 
obtained. This process is continued until the functions are self-consistent | 
to the required order of accuracy. In practice the third term in the 
Hamiltonian (11.30) is approximated by a spherically symmetric term 
by averaging over all directions and the one-electron functions can then 
be characterized by / and m; quantum numbers. 


The Hartree-Fock self-consistent field 


The variational method used above was suggested independently by Fock 
and Slater in 1930 after Hatree obtained the equation (11.28) directly by 
physical arguments. 

The simple product wave-function (11.24) is not antisymmetric with 
respect to interchange of electrons. The Hartree-Fock method starts by 
assuming the wave-function to be the antisymmetric determinant 


$9, (t1, 12)... $9, (Th, Anz) 


1 | $9,@1, %5) 
WCF, O25 «3 tnOnz) = Q. 1 Is. 


Go,(15%12) PQ (tn One) 
(11.31) 


The expectation value of the Hamiltonian (11.2) obtained using the wave- 
function (11.31), is minimized subject to the condition that each one- 
electron function is normalized. These one-electron functions must be 
linearly independent so that it is possible to form an orthogonal set from 
them (Adams, 1961). 

The set of equations which the spatial parts of the one-electron 
functions satisfy can be written in the ‘standard form’ 


Hb9,(81) = €x¢0,(01) k= 1, 2, re (4 ad 1 32a) 
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where the one-electron Hamiltonians are given by 


2 
A? 5 Ze? ; e? S \ | Ga;(ra)I dr, 


~ 4n€gr,  41€q jr) —rel 
7 = 


e S [ Siedebaenvortenooten 


Ane Ir, — 12146, (11) $0, (1) 


T2- 
(11.32b) 


The first two terms are just the first two terms of Hartree’s- 
Hamiltonian (11.30), The third term is summed over all the ground state 
wave-functions including the kth and represents the Coulomb energy of 
the kth state electron in the average field of all the electrons including 
itself. This Coulomb interaction of an electron with itself is clearly not 
acceptable and must be compensated somehow by the fourth term. This 
new ‘exchange’ term is summed over all wave-functions with spin parallel 
to the kth. In fact if only the element in this summation with j = k is 
used, the Hartree-Fock and Hartree-Hamiltonians are identical. 

Of course, the equations (11.32a) have an infinite number of solutions. 
There are the 7 one-electron wave-functions which are occupied in the 
ground state and also an infinite number of solutions corresponding to 
unoccupied excited states. All the Hartree-Fock solutions are orthogonal 
to each other. This is not the case for the Hartree solutions. 

The total energy of the system is the expectation value of the 
Hamiltonian (11.2) obtained using the determinantal wave-function 
(11.31) and is less than the sum of single particle energies 2 €, as this 
sum counts each electron-electron interaction twice. However the mag- 
nitude of the single particle energies are approximately equal to the 
ionization energies when only one particle is excited. 

It is to be expected that as an electron moves it will repel other 
electrons and a ‘coulomb hole’ will be formed about it. There is corre- 
lation between the motion of the electrons. The Hartree-Fock wave- 
function does take some account of correlation between electrons with 
parallel spins but it ignores correlation between electrons of unlike spin. 
Of course correlation reduces the energy of the system by keeping 
electrons apart and the Hartree-Fock method is a better approximation 
than the Hartree method which completely ignores correlation. The 
difference between the energy of the system obtained using the Hartree 
and the Hartree-Fock methods is called the ‘exchange energy’. The 
reader is warned not to give too much physical significance to this as it is 
simply the difference between two mathematical approximations. The 
Hartree-Fock energy Ey ¢ is an upper bound to the exact ground state 
energy E. The energy difference (E — Fy F.) is called the ‘correlation 
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energy’ and is a significant measure of the accuracy of the Hartree-Fock 
method. The correlation energies for atoms are of the order of a few 
electron volts. 

In practice the Hartree-Fock equations are solved numerically by 
iteration until self-consistent solutions are obtained. 

The infinite set of solutions of the Hartree-Fock equations forms a 
complete set of one-electron functions. The infinite set of n x n deter- 
minantal wave-functions constructed by taking different combinations 
of n Hartree-Fock solutions form a complete set of n-electron anti- 
symmetric wave-functions. The exact solutions of Schrodinger’s many- 
electron equation may be expressed as linear combinations of these 
determinantal wave-functions. However in many cases a single determinant 
is fairly reasonable. 


The periodic table and the shell model 


Usually the coulomb and exchange terms in (11.32) are averaged over all 
directions to make the n spherically symmetric and the one-electron 
functions are then characterized by / and m, angular momentum quantum 
numbers as well as by a principle quantum number n. (The number of 
nodes, excluding the origin, of the radial part of the one-electron function 
is equal ton —1—- 1.) 

The one-electron states are denoted by 


(1s) when n=1, 1=0 
(2s) when n=2, 1=0 
n=2, l=1 


(2p) when 


with the same notation used for hydrogen type states. In this general 
(not coulomb) central field approximation the energy of the ith electron 
depends upon both n; and J;. 

Suppose the Hartree-Fock equations (11.32) are equivalent to a 
variable separable, central field Hamiltonian for the system with the 
form 


n 


7 nw 
KH = 2. [- aa Vit a) (11.33) 


i= 


where u;(r;) are suitable one-electron potentials. (By further approxi- 
mating the exchange terms in (11.32) Slater (1951) has been able to 
show this.) Then the determinental wave-function formed by solutions 
of the Hartree~Fock equations are eigenfunctions of the central field 
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Hamiltonian #9. This Hamiltonian is invariant under separate rotations 
of the co-ordinates of each particle and so commutes with the separate 
particle angular momentum operators. Its eigenvalues depend only upon 
the set of quantum numbers (7;,/;),i= 1,2, ..., associated with the 
one-electron functions in the determinantal eigenfunctions. The energy 
is independent of the quantum numbers m1; and ms; as there are no 
spin-orbit interaction terms. 

This central field ‘atomic’ energy level is called a configuration 
(sometimes ‘configuration’ refers to the set of n associated one-electron 
functions) and its degeneracy is less than or equal to 


2” TI 2h +1) (11.34) 
i=l 


as each one-electron energy has degeneracy 2(2/; + 1). In general the 
Pauli principle prevents the degeneracy (11.34) from being attained and 
also limits the configurations allowed. 

A central field single electron state is completely defined by the set of 
four quantum numbers (n, J, m;, ms). Electrons are Fermi particles and 
the antisymmetry of the many-particle wave-function demands that only 
one electron can occupy a completely defined state. This is the Pauli 
exclusion principle. The combination of the Pauli principle with the 
Hartree-Fock method gives an explanation of the periodic system of 
elements. 

In the ground state the one-electron atomic levels of lowest energy 
are filled up first, the Pauli principle preventing all electrons from 
occupying the lowest (1s) state. The chemical properties are mainly 
determined by the ‘valence’ electrons in the shell (defined by x, /) of 
greatest energy. If the outermost shell is completely filled and there is a 
considerable energy gap to the next empty shell then it is to be expected 
that the element will be inert. The rare gases are typical examples. 

Atomic hydrogen has the configuration (1s) in the ground state. The 
next atom, helium, has both electrons in this shell and the configuration 
is denoted by (1s). This is a closed shell and helium is inert. The next 
element, lithium, has the configuration (1s)?(2s) and has one electron 
outside a closed shell. Such elements with one electron in an s-shell 
(apart from hydrogen) are chemically similar and these are the alkalis. 
Atoms with a full p-shell (or (1s)-shell) are inert and are the rare gases. 

The configuration of atoms with more and more electrons can be 
obtained in this way and imply periodicities in the chemical properties 
of the elements. The number of electrons in a completely filled shell 
depends only upon the /-value and is given by 2(2/ + 1). For thes, p, d 
states the number of electrons is 2, 6, 10, respectively. 

Complications do arise because the one-electron energies depend upon 
both n and 1 For example, the shells (4s) and (3d) have similar energy 
and they may not be filled up in sequence. Another transition period 
occurs because of the similarity in energy of the shells (5s) and (4d). 
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The nuclear shell model is an attempt to explain nuclear properties in | 
a similar manner. An atomic nucleus is composed of protons and neutrons | 
both of which are Fermi particles with spin 4 and with similar masses. 
These nucleons interact through a short range (~107!5 m) attractive 
nuclear force. (The protons of course also repel each other by the long 
range coulomb force.) The shell model assumes that each nucleon moves 
in an averaged field due to the other nucleons. For many nuclei, this 
field can be taken to be spherically symmetric and this means that each 
nucleon state can be classified by (n, J) quantum numbers. However 
spin-orbit coupling is relatively important and the energy levels also 
depend upon, to a large extent upon j. States of similar energy may be 
grouped together and called nucleon shells but the numbers of allowed 
nucleons in the shells are different from the electrons because of the 
strong spin-orbit coupling. The neutrons and the protons form their own 
shells but the energy difference is very small for the lighter nuclei. The 
three lowest shells can contain up to 2, 6 or 12 protons or neutrons 
respectively. A nucleus containing 2, 8 or 20 . . . protons (or neutrons) 
will have completed proton (neutron) shells. These are the ‘magic 
numbers’. A nucleus with completed proton and neutron shells may be 
expected to be stable. The isotope 3He is a typical example. The nuclear 
shell model has proved useful in considering nuclear properties. 


Electrostatic interaction and terms 


The atomic configuration degeneracy described above does not really 
occur as the averaged central field potential used in the Hartree-Fock 
equations (and hence in #0) is only an approximation to the actual 
electrostatic interaction between the electrons. The difference between 
the two Hamiltonians is (from (11.2) and (11.33)) 


n n n 
Ze? 1 e? n 
. tT ————_]- Su) 1.35 
> Aner; 2 > > el Pe ( ) 
i=l i=1 f= q 
Gi #7) 


and may be regarded as a perturbation. This is a degenerate perturbation 
problem and the correct zeroth order functions must be used. The 
functions required are characterized by L, S quantum numbers. This is 
explained below. 

The Hamiltonian (11.2) unlike #% is not invariant under a rotation 
of the spatial co-ordinates of any one particle taken alone and so does 
not commute with the individual electron angular momentum operators. 
The /;,i=1,2,...,n are not good quantum numbers. However this 
Hamiltonian is invariant under a simultaneous rotation of all the spatial 
(or spin) co-ordinates and so commutes with the system angular (and 
spin) momentum operators. The energy of the system depends upon the 
system momentum quantum numbers L and S. The Hamiltonian (11.2) is 
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spin-independent and it may appear that the energy should only depend 
upon L. In fact the energy does also depend upon S, and this is an 
exchange effect. (See discussion of the Helium atom (11.3).) 

The determinantal wave-functions belonging to a given configuration 
form the basis for a reducible representation of the symmetry group of 
the Hamiltonian (11.2). 

Repeated use of the vector coupling rule (11.5) or (11.6) may be used 
to give the possible irreducible representations and hence the allowed 
values of L and S that are included. Once more, application of the Pauli 
principle may exclude some of these values of Z and S. 

By taking suitable combinations of the determinantal wave-functions 
belonging to a configuration it is possible to construct sets of 


QL +1)QS +1) (11.36) 


functions for each allowed pair of values (L, 5) which transform according 
to D() and D(S) under spatial and spin rotations respectively. Each set 
of wave-functions belongs to the same energy level, called a spectral 

term, of the Hamiltonian (11.2) and the degeneracy of the term defined 
by (Z, S) is given by (11.36). 

The wave-functions for a given term (Z, S) do not consist entirely of 
linear combinations of the wave-functions from the corresponding 
configuration. They include small contributions from terms, with the 
same values of Z and S, belonging to different configurations. This effect 
is called configuration:interaction. The energy difference between terms 
associated with a given configuration is small compared with the energy 
difference between configurations. 

As an illustration consider the following examples. The helium atom 
ground state has the configuration with both electrons in the (1s) state. 


(1s). (11.37) 


These electrons have the same values of n and / and are said to be 
equivalent (11.34) gives this configuration a maximum possible degeneracy 
of 2? = 4, However this is clearly incorrect as the Pauli principle requires 
that the two electrons have opposite spins and this configuration is not 
degenerate. The allowed values of L are given by the coupling rule 


D© x p@ =p@ 
i.e. 

L=0. (11.38) 
The allowed values of S are given by 


DOP xy DA) = pM + pO 


S=0,1. (11.39) 
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So it would appear that the eigenfunction (11.37) includes the terms 
tee 8 


This is incorrect since the Pauli principle does not permit the term 3S 
and the configuration corresponds to the single term !S. 
The first excited state of helium has the configuration 


(1s)(2s) 


(11.34) does give the degeneracy correctly as 2? = 4 in this case. The 
allowed values of Z and S are given by (11.38) and (11.39) and both the 
terms !S and 3S are included in the configuration. 

The next excited state of helium has the configuration 


(1s)(2p) 


and (11.34) gives the correct degeneracy 2? x 1 x 3 = 12. The allowed 
values of L are given by 


D© x pM = p® 
i.e. 
L=1 


and S = 0, 1 as before. The allowed terms in the configuration are 1P and 
3P. These terms have degeneracy three and nine respectively making 
twelve wave-functions in all as required. 

These last two configurations each have different terms with the same 
value of L but different values of S. Even though the Hamiltonian (11.2) 
does not depend upon the spin co-ordinates these terms do have different 
energies. This is an exchange effect and arises because determinantal 
wave-functions require antisymmetry in all the co-ordinates including 
spin. 

In the previous discussion of the helium atom it was explained that 
the term, arising from the excited configuration (1s)(2s), in which the 
electrons have parallel spins (S = 1) has a vanishing probability for the 
electrons to be in the same point in space at the same instant. This 
reduces the positive contribution to the energy arising from the electron- 
electron interaction. This result can be generalized to give Hund’s rule. 
The term with the lowest energy obtained from a given configuration is 
that term with the largest value of S and the largest value of L associated 
with that S. 


Fine structure 


The Hamiltonian (11.2) neglects relativistic effects and excludes all 
terms that depend upon the spin co-ordinates. The coupling between the 
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spin and the orbital motion of each electron adds a contribution, 


1 1dV, 
2m? > os {LixSix + Liysiy + Lizsiz} (11.40) 
i=1 ! T 


to the Hamiltonian where -%;, and s;, are the x-components of the 
momenta of the ith-particle. In addition there are interactions between 
the spin of one electron and the orbital motion of the others and also 
interactions between different spins. 

These terms are of the order of (v/c)? and produce the fine structure 
of term energy levels. All spin interaction terms (e.g. (11.40)) involve 
first or second powers of inner products of the vector operators (.77,, 
Liy, Liz) and (S;x, Siy, Siz). These inner products are not invariant 
under a separate rotation of the spatial or spin co-ordinates of all the 
electrons but they, and hence the complete Hamiltonian are invariant 
under a simultaneous rotation of all the co-ordinates both spatial and 
spin. That is, the complete Hamiltonian does not commute with the 
operators representing the system angular momentum or the spin angular 
momentum but does commute with the total angular momentum 
operators. L and S are no longer good quantum numbers but J and My; 
are good quantum numbers. The energy depends only upon J and so each 
energy level has degeneracy (2J + 1). 

For the lighter atoms the spin terms are smaller than the electrostatic 
terms and the spin terms can be regarded as a perturbation of the spectral 
term energy level. The (2L + 1)(2S + 1) degenerate wave-functions 
belonging to a given spectral term (Z, S) span a function space that is 
reducible under the new (reduced) symmetry group of the Hamiltonian 
and so this degeneracy will be split by the spin perturbation to give the 
spectral term ‘fine structure’. This function space can be separated into 
sub-spaces each of which transform irreducibly under the simultaneous 
orbital-spin rotation group according to DY) where J is given by the 
vector coupling rule (11.6). 


J=(L+S),(L+S-—1),...,1L—-SI. (11.41) 


Each of these values of J corresponds to a separate energy level. The 
values of the fine-structure splittings can be obtained from (11.40) by 
calculating the appropriate matrix elements. 

This approximate method for constructing the wave-functions for an 
atomic energy level from a given term is based on the assumption that 
the ‘fine structure’ energy differences are small compared with the 
differences between terms. This is often the case and is referred to as 
Russell-Saunders or LS coupling. 

The exact wave-functions for an atomic level with angular momentum 
quantum numbers (J, M,) will also contain small contributions from 
other terms. 
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For heavier atoms the relativistic spin terms are not small compared 
with the non-central field electrostatic terms and cannot be treated as 
perturbation in this way. A better approximation to the atomic wave- 
functions is obtained by characterizing each electron by its total angular 
momentum j. The non-central field electrostatic terms are then used as a 
perturbation to construct wave-functions with total angular momentum 
J. This scheme is called jj-coupling. Actually this coupling method is 
never completely valid although it may apply for highly excited states. 
It is the opposed limiting case to the LS scheme and the true coupling 
scheme for heavy atoms may lie somewhere in between. 

In addition to the fine-structure splitting due to the electron spin 
there is also a very small splitting of energy levels arising from the weak 
interaction between the spin of the nucleus and the electrons. The 
nuclear spin and the total angular momentum of the electrons can be 
coupled together. This produces the hyperfine structure of the levels and 
the splitting is about 1073 of the fine structure splitting. 


Selection rules 


It can be shown that when the radiation wave-length is large compared 
with atomic dimensions and the electron-dipole approximation is valid, 
the spontaneous and induced transition probabilities are proportional to 
the square of the modulus of the dipole integral. 


fue i | Wy; d7 


y,; and We are the initial and final many-electron states, rz is the position 
of the kth electron and the integration is over all electron co-ordinates. 
(Compare (8.38) for a single electron.) 

2 r; changes sign under an inversion and since the atomic wave- 
functions defined by (J, My) have either positive or negative parity the 
dipole integral (11.42) will vanish unless the transition involves a change 
in parity of the atomic wave-functions. This selection rule is called 
Laporte’s rule. 

There are other selection rules and group theory will now be used to 
determine the non-zero transition probabilities. 

Suppose the function ¢ is one of the basis functions for an irreducible 
representation of a symmetry group. It can be shown that the integral 
over all co-ordinates 


[¢ar=o (11.43) 


if the representation is not the unit representation (see Landau and 
Lifshitz, p. 343). As examples consider the hydrogen type one-electron 
functions. The functions which transform according to the irreducible 


2 
(11.42) 
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representation D) of the rotation group are Wp,, Wp, and wp, and 
clearly 


fue, ar= Juz, dr=|vp, dr=0. 


However the function which transforms according to the unit 
representation D© is y, and 


fu. ar#o. 


Now the three components of Z7_, r, transform amongst themselves 
as a vector under simultaneous rotation of all the electron co-ordinates. 
That is, they transform according to the three-dimensional irreducible 
representation D@), 

The initial state wave-function for the atom W; must transform 
according to some irreducible representation DV) and so the set of all 
functions (remember y; is 2J; + 1 fold degenerate) 


> IW; (11.44) 
k=1 


transform according to the sum of irreducible representations 
=D) 4 pDG4DG-) F,21 
D® x pYdD = DG) 4 pl) J,=h 
=p Ji = 0. 


Of course the final state of the atom must also transform according to 
some irreducible representation DP) and so the integrand of (11.42) 
transforms according to 


D@P x DD x DAD, (11.45) 


This triple product will contain the unit representation only if DUA is 
contained in the product D@ x D@i, That is, if 


Jp = J, +1, 53, ;-1 J; 21 
=4,4 J;=4 
=1 J, = 0. 


The dipole integral (11.42) may be different from zero only for pairs of 
states such that 


8J=+1,-1 
6S =0 (11.46) 
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except the transition J; = 0 to Jy = 0. These are the selection rules for 
allowed transitions. It can be shown that 6M; = +1,0,-—1. 

For lighter atoms in the Russell-Saunders approximation, the spatial 
part of an atomic level wave-function transforms according to D) under 
space co-ordinate rotations and the spin part transforms according to D®) 
under spin rotations. Since Ly=, r, transforms according to D@) and D( 
under space and spin rotations respectively, then it can easily be shown 
that the selection rules for transitions between Russell-Saunders states are 


6L=+1,-1 

6L =0 | 
except the transition 

L;=O0toLf=0 
and 

6S=0. (11.47) 


Observe that the total spin is conserved. 


The sodium D-lines 


In the ground state the sodium atom in the self-consistent field approxi- 
mation has eleven electrons in the configuration 


(1s)?(28)(2p)°(3s). 


The Pauli principle drastically reduces the possible ‘degeneracy’ of this 
configuration. The inner closed core of ten electrons transforms according 
to D©) in both orbital and spin space and the only degeneracy allowed 
occurs from the possible values for the spin quantum number s for the 
outer (3s) electron. The ground state configuration is doubly degenerate 
and is represented by the doublet term L = 0, S = 4, ie. 2S. The spin- 
orbit coupling does not reduce this degeneracy and the atomic level may 
be written 


2Sy2siies = LL =O, S=4, J=H. 


The subscript gives the J-value. 

The sodium atom may be excited into a higher energy perhaps by 
applying a suitable field or by heating it in an electric arc. The lowest 
excited configuration is 


(1s)? (2s)*(2p)° (3p). 


There are six such ‘degenerate’ states corresponding to the three 
different values of / and the two different values of s taken by the outer 
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(3p) electron. For the complete configuration LZ =1,S= 4 and so it 
contains the single term 2P. The allowed values of the total angular 
momentum of the system are 


J=(1 +4), (1-4) 

ie. 
J =3,4. 

Consequently, the term ?P splits up into the separate energy levels 
*P3/2, *P1y2 


under the spin-orbit interaction. 


(18)?(23)?(2p)°3s) 
Sy 


Figure 11.3 The sodium D-4ine transition. 


The downward-transition from either of the levels *P3/2 or 7P4/2 to 
the level 2S/2 is possible as 


6L=-1, 5S = 0. 


These two transitions are the cause of the D-line doublet in the sodium 
spectrum and correspond to wave-lengths 5.889-96 A and 5895-93 A 
respectively. 


11.5 Second Quantization 


The method of second quantization is very useful for studying systems 
of identical particles. It is necessary in the quantization of fields where 
the number of particles may actually change. For example, photons may 
be created or annihilated in a radiation field. However this work is out- 
side the scope of an introductory text and the second quantization 
scheme will only be applied to electron systems below. 
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Let {¢9,(x)} be a complete set (infinite) of orthonormal one-particle 
functions. x denotes the set of four co-ordinates including the spin- 
orientation of the particle. This set may be combined to form anti- 
symmetric (or symmetric) n-particle wave-functions. For electrons 
(i.e. Fermi particles the n-particle wave-function is anti-symmetry. Let 
B(")(x1,x2,..., X,) be such an anti-symmetric n-particle function 
constructed from the first n functions in the complete set. Then 


9, (x1)... $9, &n) 


eyecare 
9Qn(%1) -- - $y (%n) 


(x1, x2,.. (11.48) 


Similarly, other n-particle determinantal wave-functions can be con- 
structed using other sets of n single-particle functions. In this way a 
complete set of determinantal wave-functions is obtained and an arbitrary 
n-particle wave-functions can be expanded in terms of these n x n 
determinantal wave-functions. 

Each of the determinantal wave-functions is completely specified 
when the number of particles in each single-particle state is specified. 

Let n; be the number of particles in the state ¢g,(x). The wave-function 
(11.48) corresponds to 


ny =n, =...=n, = 1, Mn+, =Nn+2 =...70. (11.49) 


The ‘occupation numbers’ n; can only take the values 0 or 1 for Fermi 
particles (Pauli principle). This restriction does not apply to particles 
obeying Bose-Einstein statistics. 

The n-particle determinantal wave-functions will now be regarded as 
functions of the occupation numbers of the states $9;({x) rather than as 
functions of the particle co-ordinates. With this in mind the wave-functions 
will be written 


O(1,,N2,N3,...) (11.50) 


where 


(11.51) 


Ms 
= 
{il 
= 


and for Fermi particles n; = 0, 1. 
The wave-function (11.48) is written 


@(1,1,1,...,1,0,0,...). 
— 


n 
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In this new scheme the operators representing dynamical variables 
must act on the occupation numbers rather than on the particle co- 
ordinates. At this point it is necessary to introduce creation and annihil- 
ation operators. The creation operator a} operates on an-electron wave- 
function such as (11.50) to add a new particle in the state $9; (x), ie. 


af B(ny,N2,....Ni,--.)= (LI) —1j)OMy, Ne, ....m: +1,...) 
(11.52) 


where 
i-1 
$8; 7 2 Nks (11.53) 
k=1 


§; is equal to the number of occupied states up to but excluding ¢g; (x). 
(—1)* is a phase factor, the significance of which will become apparent 
later. Observe that if n; = 1, the right hand side of (11.52) is zero in 
agreement with the Pauli principle. A determinantal wave-function with 
two particles in the same state is identically zero. 

An annihilation operator a; is defined to remove a particle from the 
state $9, (x) 


a, P(ny, No, wee Me. .) = (—1'ing® (ny, no, oy Qt — | ee .) 
(11.54) 
(aj is the Hermitian adjoint of a;). Similar but different expressions 
define the action of the creation and annihilation operators if the 
particles are bosons. 
As n; can only equal one or zero for Fermi particles then from (11.52) 

aja; =0 (11.55) 
and from (11.54) 

a;Q; =0. (11.56) 


The vacuum state is defined to be that state containing no particles. 
Hence 


a} Pyac = O(n = 1,02 =0,...) = o9,(%) 


represents the creation of a single particle in the state ¢g9 (x). A general 
determinantal wave-function can be constructed from the vacuum state. 


O(ny,n2,...5Nj,.-) = @D™@Y? ... GP... Pygee (11.57) 
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Changing the order of the creation operators in a product corresponds 
to interchanging the particles. When two Fermi particles are interchanged 
the wave-function changes sign. Hence it is necessary that 


(aja;)® = —(@ja;)® (11.58) 


for arbitrary ®, i.e. 
(a7, a7]. =0 (11.59) 


where [a, 8] + =a + Ba is the anticommunicator of a and 8. It is to 
satisfy this requirement that the phases are chosen as in (11.52) and 
(11.54). To see this consider 


aja; P(m,,...,n;=0,...,0; =0,...) 
=(-1)%a7O(n,,.. .. 0; 


=(-1)8(-1)O@,....2;21,..,m21,..) f>i 


fl 
ad 
x 
u 
= 
— 


Also 
aja; P(ny,...,n; =0,...,n;=0,...) 
=(-1)%a/O(m,...,m; =1,...,2; = 0) 


=(-I¥C1I)VO(,....0,=1,..,.21,...) F>i 
(11.61) 


Since s; = 5; + 1 then (11.58) is indeed satisfied. 
The annihilation operators satisfy a relationship similar to (11.59) 


[a;,a;], =0. (11.62) 


There are also commutation relations which involve both creation 
and annihilation operators. Consider the action of the operator aja; on 
the determinantal wave-function ®(7,,...,7;,...) 


a, P(ny,...,nj,...) =(—1)*inj O(y,...,2; —1,...) 
aja;P(ny,...,N;,..-) =(—1)%ing(-1) 7 O(1y, .. .. 1;, .- ) 
aja;P(n1,...,.N;,...) =m, ....2j,--) 

as n; = 0,1 for Fermi particles. The operator 
N; = 474; (11.63) 


is the number operator for the state ¢g, (x). Its eigenvalues are the 
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number of particles in this state. The total number operator is defined 
to be 


N= N;,. (11.64) 


i=l 
Consider now the action of the operator ajaj 

ay ®(ny,...,n;,..) =I —n)O(,....2;4+1,..) 

aja; ®(n,,...,nj,...)=(-1)%C —n,; )(-1)#(; + 1) 

x O(1,,...,7;,.-.) 

i.e. 

aja; P(ny,...,nj,...) = (1 —nj) (ny, ...,1j,..-)- (11.65) 
From (11.62) and (11.65) 

(aja; + ajay )P(ny,...,nj,.-.)=P(1y,...Mi,-.-). 
This is a special case of the commutation rule 

aja; +aj;aj =1 
or generally 

{aj,, a;)+ =5y. (11.66) 


From the definition (11.63) and the commutation rules (11.59), 
(11.62) and (11.66), it is quite easy to show that the number of operators 
belonging to different states commute, i.e. 


[N;,.Nj] =0. (11.67) 


Consequently functions can be found that are simultaneously eigen- 
functions of all these number operators. These are of course the 
determinantal wave-functions. 

Finally it can be shown that 


[N,,4;] = -8 4; (11.68) 
and 


(Ni. aj] = 8 a7. (11.69) 


Representation of observables 


A physical observable is represented in quantum mechanics by an 
operator. Let a(x) be the ‘co-ordinate’ operator that represents the 
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observable A. As the set {¢g,(x)} is complete the effect of a(x) on the 
single particle function $9, (x) is 


co 


agg,(x) = 2 aj; 69;(x) (11.70) 
‘5 
with 
Oyj = { oS;a09, dry. 


(The integration includes a sum over spin co-ordinates.) The right hand 
side of (11.70) can be written in terms of creation and annihilation 
Operators. 

2 a4; 2,99, (x) = ( 


2 aia] $9; (x) 


[J 
j j=Lk=l 


-( 2 2 aon a) G7 ®yac). (11.71) 
J= = 


From (11.70) and (11.71) it is seen that the operator a(x) which acts on 
functions of the co-ordinates x can be expressed as an operator acting on 
the occupation numbers n;. 


a(x) = 2: Daj on ay. (11.72) 
i 


This result has been shown for the simplest case of a one-electron 
function but it can be extended. 


Let @(x,, X2,..., X,) be an additive operator for a system of n 
particles such that 
(Ky, X2,-- +) Xq) =a(xz) ta(xg) +... ta(x,). (11.73) 


This operator is symmetrical with respect to all the particles and it can 
be shown that 


(X,,X2,.. Xn) = 2 Dai aig dy (11.74) 
J 


with aj, = f $9;0(x)OQ, Ty as before. 

As an example consider a system of n identical particles whose mutual 
interaction can be ignored. The system Hamiltonian is the additive 
operator 


H= 5 H, (11.75) 


r=t 


with 
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The simplest non-trivial case is a system of two particles. The wave- 
function for the system can be expressed as a linear sum of 2-particle 
determinantal wave-functions of the form 


++ 1 
@] Bm Pyac = Va [$0;(X1)$0m(X2) — $0 m(X1)$0;(x2)] 


where the phase of the determinant has been sensibly chosen. 
The effect of the system Hamiltonian on this determinant is 


[0 +] a [.0;(%1) bm (X2) ~ $n (1) $0; (2 )] 


1 
= Vi tnt 2 4160; (X1) — 6Q;(X2) 2 £jmQ; (x1) 


+ ¢9,(X1) 2 4 jm $Q;(K2) — 60m (X1) 2 £j19Q;(X2)| (11.76) 
as 


H,$Q,(Xr) = 2 £199; (%) for r= 1, 2, 
The right hand side of (11.76) is 


Shi a [62m X2}40; (1) — b2m(X1)$0) (2) I 


— E fim Fe (—bor(%s}894(%2) + 624(82)40) (61) 
In terms of the creation and annihilation operators this becomes 
214 10am Prac — him; a Pyacl- (11.77) 
It is quite easy to show that (11.77) is equal to 
2 2 aj hjx@x [a1 2mP vac] 


and hence deduce that 
[A + #2) = 2 Daj hiner 
i 
as required. For a system of n non-interacting particles, the Hamiltonian 


in the occupation number representation is 


22.07 hina. (11.78) 
I 
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Suppose the expansion set {¢gQ;(x)} is chosen to be the set of eigen- 
functions of the one-electron Hamiltonians so that 


KH, 00; (x,) = EQ; $9; (%,) 


and 

Aix = EQ; Six. (11.79) 
The Hamiltonian (11.78) becomes 

LE opal 44 = ZEON (11.80) 


and its eigenfunctions are also the simultaneous eigenfunctions of the 
number operator N;. The eigenvalues are 


2EQ)7 n;=0,1 and 2 j= 7. (11.81) 
Unfortunately the operators in many particle systems are not always 


additive in the sense of (11.73). The particles often interact with each 
other and co-ordinate operators occur which can be written 


B(x, X2; sey Xn) = 226%, Xs), (11.82) 
where B(x; , Xs) operates on functions involving x, or Xs. It can be shown 
- that 
B(X4, X2,. "9 Xn) =4 L2ZzZ Ser aiaj aay, (11.83) 
ij 
with 


Bi = ff $9;(x1) 99, (X2)B(K1, X2)$0;,(%1) 99; (%2) d7x, dT, 


For example, the Coulomb interactions between electrons adds a term 


e2 


22 V(t, —1,) where V(t, — ts) = 4neg\t, — gl 


to the n-particle Hamiltonian (11.75). 1, is the spatial part of x,. The 
Hamiltonian in the occupation number representation is then 


22 aj kindy +¥ 22 Zaia'V aa, (11.84) 
j ij 


where hj, is defined as before and 


v4, = mal } 65 (416502) dag 1)40)(82) dr, dt, 
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Electron field operators 


The rules for finding the form of the operators, representing observables, 
in the occupation number representation can be stated in terms of new 
‘electron field’ operators &*(x) and V(x) defined by 


WWQ=2¢9,Way WK) = 2 9,4. (11.85) 


The operator ¥*(xo) creates a particle with the co-ordinates 
(including spin) xo. 

It is quite easy to show that these operators satisfy the commutation 
relations 


[¥(x), ¥x')]. =0 

[¥), WK), = 5 (x — x’) (11.86) 
where 

8(x —x')=8(x — x)8(y —y')8(Z — z')do,3. 


These operators are of interest because the equations (11.74) and 
(11.83) can be written 


(X15 XD, - Xn) = | WRa(K) VOW) dre (11.87) 


BOR, x2, x =F [| WAY O)BER, x’'UO’) HO) drydr! 
(11.88) 


respectively. The reader is asked to confirm that the right hand sides of 
(11.87) and (11.88) agree with the right hand sides of (11.74) and 
(11.83) by substituting (11.85) for the electron field operators. 

The electron field operators play a similar role to that of the wave- 
function in the Schrédinger theory. In the second quantization formalism 
wave-functions are in some sense replaced by operators. 


Bosons 


The work above has been specifically oriented towards Fermi particles. 
There are differences in the formalism when applied to bosons. The n- 
particle wave-functions ®(7,, 2, . . .) are now symmetric in interchanging 
any two particles and the occupation numbers can exceed unity. 

Creation and annihilation operators are defined by 


af @®(ny,n2,...,nj,...) = Vnjp +1 O(n, n2,...,.n, +1,...) 
a;P(n4,No,...,1j,...)= Vny Oy, M2,....n;—1,...) 


| 
302 A FIRST COURSE IN QUANTUM MECHANICS | 


respectively. The commutation relations are also different, e.g. 
a7, a7] =0. (11.89) 


This development is outside the scope of this text although it is 
certainly important in many problems, such as in a discussion of photons 
in an electromagnetic field or in the theory of phonons in lattice 
vibrations. 

The interested reader is referred to Introductory Quantum Electro- 
dynamics, by Power, and to Concepts in Solids, by Anderson. The latter 
text has an interesting derivation of the Hartree-Fock equations in second 
quantization formalism (p. 15). 


PROBLEMS 
1 Use the vector coupling rule to show that 

(a) D® x pG/) = pCVD 4 pG/2) 

(b) D x D x DM =D® 42p@ + 3p + pO 

(c) D|/?) x DG?) xy pW/2) = pGF/ 4 2pG/2), 

2 Show that the inversion operator is Hermitian. 


3 The electric dipole moment of a system of n particles each of charge 
e can be written 


n 
Pre D> 
k=1 


where r;, is the position vector of the kth particle. Explain why P 
changes sign under the action of the inversion operator. 

Deduce that if the state function (1, rz, .. ., f,) has either even or 
odd parity then the expectation value of the dipole moment is zero, ie. 


[vePy dry ...dr_ =0. 


This show that in the dipole approximation transitions are allowed only 
between even and odd states. 


4 Consider a tightly bound collection of particles which behaves as a 
single particle at a point r with S = 1. The wave-functions may be written 
as a column matrix 


V4 Ct) 
YVOEO= | yon 
v_(r, ¢) 


and m, can take the values +1,0, —1. 
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Verify that the appropriate commutation rules are satisfied by the 
matrix representation 


0 1 0 0 -i O 
S e 1 0 | S f 0 
ars =—= [7 -i 
. 2 2: V2 

01 0 0 i O 

10 0 
S,=nr 10 0 O 

00 -l1, 


What are the eigenvectors corresponding to ms = 1,0, —1 respectively? 
Show that the matrix representing See is equal to 2h? times the 3 x 3 
unit matrix as required. 


5 Consider a system composed of three electrons. If the spin interaction 
terms are ignored, the system Hamiltonian is invariant under a separate 
rotation of each spin co-ordinate and the spin momentum of each electron 
is conserved. For each electron s = 4. Using part (c) of problem (1), or 
otherwise, deduce that there are a quartet of system states with S =3 and 
M, = 3,4, —4, —3 and two separate doublets with S = 4 and M, = 4, —4. 

Obtain the values of S and M, for each of the following spin functions 
and confirm that the quartet is 


a (o1z)@ (02,)a (3) 

a [a1(04,) (092) 8(032) + a(012)8(02z)0(032) 
+ B(o,z)0 (022 )a(03z)] 

WA [8(04_)B(02,)0(03¢) + B(01z)0(022)B (032) 
+ 0&(012)8(02z)B(63z)) 


B(042)B(022)B(93z) 
and that the two doublets may be written 


a falas aosdiGsp) #a0,) bases) 
= 26(0;,)a(o2,)a(o3,)] 

a {8 (012)B(022)a(032) + B(04z)0(022)8 (02) 
oy 20(012)B(022)B(63z)] 
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and 


ae 


V2 (2 (012 )a (022) B (032) — &(012)8 (G22 )a (63,)] 


Fz [8(02)B(022)0(03z) — B(012)4(024)8(0,)]. 


Observe that the first doublet is symmetric in the interchange of the 
second and third particles whereas the second doublet is antisymmetric 
in these particles. 


6 Consider a linear box of unit length bounded by walls of infinite 
potential. If the origin is taken at one end of the box confirm that the 
wave-functions for an electron inside the box is 


Wa(x)= V2sin(nmx)  n=1,2,.... 


Write down the spatial part of the determinantal wave-function (x , x2) 
describing two electrons, with identical spin, in the two lowest energy 
states. If one electron is at the point x = 4, the probability distribution 
of the other is 


W*@,x2)WG, x2). 
Show that this is equal to 
2 


1 
2 [n 1X2 —~——= sin 21x 
V2 


Sketch this function and explain how it illustrates ‘spin correlation’. 
Compare this with the distribution obtained from a simple product 
wave-function. 


7 Show that the expectation value of the Hamiltonian (11.2) obtained 
using the Hartree product (11.24) is 


Zee EE [J 160;(6)P one) ar dry. 


nae See 
An€o | tj — rel 


This shows that the energy of the system is less than the sum of the single 
particle energies. 
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8 In the approximation that the nuclei are fixed (Born-Oppenheimer) 
the Hamiltonian for the hydrogen molecule is, ignoring spin terms, 


2 2 2 2 2 2 


4m€9 \Pia Tin 12a 2b 


+ 
An€oriq 41€oR 


where r1q,/1p are the distances of the first electron from the protons 
which are at ‘a’ and ‘b’ and similarly r2g,r2» are the distances of the 
second electron from the protons. The separation of the electrons is 712 
and the protons are a distance R apart. 

Let ¢,(r), ¢, (r) be the real normalized hydrogen (1s) wave-functions 
centred on the protons at ‘a’ and ‘b’ respectively. A first approximation 
to the ground state wave-function for the molecule will be obtained by 
considering the four normalized determinantal wave-functions of the type 


1 Ga(t1)&(012) ba(t2)a (022) 
V2 =A?) | oy(ry)a(o12) do(t2)a(022) 


WM, 425 ¥2, 022) = 


where 


A= | b0(t)¢5(0) dr. 


Show that from these it is possible to construct the singlet state with 
symmetric spatial part 


1 
W(t, ¥2) ~ Vad +2) [a(t 1) $5 (82) + ba(F2)bp(11)1 


and a triplet state with antisymmetric spatial part 


Valte ta) = Fae [ta (e)8o (2) ~ data) 


Show that the energies of the system corresponding to the singlet and 
triplet states are respectively 


J K 
= +—_ + ——— 
E, 2E (1s) 1 a4 2 


J K 
feng 
BE) Naa ae 
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where E (1s) is the energy of the hydrogen (1s) state and 


( ba (11) $5 (r2) - a eens + a + ;| a(11)%, (r2) 


4n€o Tq 


x te dro, 


| exarexcy |. es a -+4| LEDKED 


xdr, dT>. 


Show that 


ieee. [22 an ef [ee a Ge 
4néq Tib " dneo - 


e 


+ 
4ne€oR 


and that 
k= — 228 [ eedooley) Ga(ti)oo(t1) 4. —{ ba(t1) bp (82) 5 (11)ba(F2) 
"an TE GQ 


An eo Tip 12 
22 
4negR { 


The first term in K is dominant and so K < 0. Explain why this means 
that the singlet state lies deeper than the triplet state. (Compare this 
result with that obtained for the helium atom.) 


xdt, dt, + 


9 Consider two particles each of mass m at positions r;, rz respectively 
which interact with one another through a potential V(|r; — r2|), which 
depends only upon their separation. The time-independent Schrédinger 
equation for the system is 


h? 
rors) +V3)Q + Vir, — 12) 2 =F 


where §2(r1, rz) is the spatial part of the state function. Following the 
argument from equations (4.45) to (4.51) show that the relative motion 
of the particles is described by the equation 


2 
1a r? oy pena eer oty + 1 <2 Si Py as 
r?ar\ dr} r*sin?@ a¢2 rr? sind a0 36 
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where p = m/2 andr, 0, ¢ are the polar co-ordinates of the second particle 
relative to the first. (Note r= |r, —1r2|.) £, is the ‘internal’ energy of 
the system. 

Show that if 


Wr, 8 6) = R(r) (4) B(9) 
it is necessary that 


d@ 
ce 

u @+rA0=0 
sin 8 7 ~ sin? 75 

ld dR iN 
alr aia ra a a 


where m, are separation constants. 
Following the argument leading to equation (4.59) show that fora 
physically acceptable solution 


aol $)> - ws FE | Reo i) 


ry? dr 
with 
12> |ml 


and m is an integer. 
10 In question 9 assume the ‘Hooke’s law’ potential 
k 2 
Vir)= 5 (r—7o)° + Vo 
where Vo, k, ro are constants. In equation (i) substitute 


R==0 


and p=r—Tro 
and hence show that for those states with 7 =0 
—— 5 += p?S=(E, — Vo)S. 


Deduce that the allowed vibrational energies are 


K 
E=Vo+(n+ Hn |= n=0,1;2,... 
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These energy levels are an approximation to the vibrational energy 
levels of a diatomic molecule. (The rotational levels are given by (4.60).) 
A better approximation would be obtained by assuming the Morse 
potential 


V(r) = Vole 20-70 /4 _ 2 ef -rod/a} 
where Vo, ro, 2 are constants. 
11 The spin functions a(o,) and B(o,) may be both chosen to have 
even parity p = +1 (see Heine’s book). Deduce that a configurational 
wave-function (including spin) has the parity (~1)*¥#, where J; is the 
angular momentum of the ith particle. The complete atomic Hamiltonian 
including spin terms is invariant under the inversion operator. Use 
perturbation theory to deduce that configuration interaction only takes 
place between configurations with the same parity. 
12 Show that following configurations include only those terms given. 
(a) (2s)? Ig 
(b) (2p)* 4S, 2D, ?P 
(c) (3d)? 3p 3p Ig In Ig 
13 Explain why the eigenfunctions belonging to the term (ZL, S) can be 
characterized by the four quantum numbers L, S, J, My. 


The spin-orbit coupling term in the many-electron Hamiltonian can 
be written (see Heine’s book). 


A(L, S)[Lesz + LySy + L,57] 


where #,, 5, etc. are the system momentum operators. Deduce that the 
energy of the wave-function defined by L, S, J, M, is 


Ey t+4A(L, S\[J(VJ +1) —-L(L +1) —S(S + 1)]h? 


where E'g is the term energy ignoring the spin-orbit coupling. (Hint: 
Compare the fine-structure theory leading to equation (10.31).) 

Show that the energy difference between fine structure components 
with 5J = 1 is equal to A(L, S)Jh?. This is Landé’s interval rule. 


14 Confirm the commutation rules (11.67), (11.68) and (11.69). 
15 Use the closure relation 
2 $0;(x)b9;(xo) = 5(K — Xo) 


to show that the field operator ¥*(xo) creates a particle with defined 
values of the co-ordinates (including spin) xo. 
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16 Show that the right hand sides of (11.87) and (11.88) agree with 
the right hand sides of (11.74) and (11.83) respectively. 


17 Obtain the commutation rule (11.89) for Bosons. 
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CHAPTER 12 | 


The Dirac Equation 


12.1 Introduction 


The quantum mechanics described in the previous chapters is essentially 
non-relativistic and has been obtained by quantizing the appropriate 
classical expressions. It is necessary t') develop the quantum theory to 
obtain a description that is valid for particles with speeds approaching 
that of light. The present chapter deals only with the relativistic theory 
for a single particle. A satisfactory many-particle theory requires quantum 
field operators and is beyond the scope of this book. 

A useful feature of the relativistic equations is that particle spin is 
built into the theory and does not have to be added in the way that 
Pauli added the spin concept to Schrédinger’s theory. 


12.2 The Lorentz Transformation 


According to the principle of special relativity, the velocity of light as 
measured by an observer is independent of the motion of the observer. 
Consider two inertial co-ordinate frames of reference S and S’ such that 
S' moves relatively to S at the constant speed v along the positive x-axis. 
The x'-axis coincides with the x-axis. (x, y, z, t) and (x’, y’, z’, t) are the 
space and time co-ordinates referred to the reference frames S and S’ 
respectively. 

Suppose that at the instant ¢ = 0 a spherical electromagnetic wave 
leaves the origin of S which at that moment coincides with the origin of 
S'. The speed of propagation of the wave front is isotropic and equal to 
the same constant value c in both reference frames and so 


x? ty? +22 — ¢7t? =0 =x’? +? +2"? — 21? (12.1) 
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y 
AY 


10) x 


Figure 12.1 


The transformation from (x, y, z, f) to (x’, y’, z', t’) that satisfies 
(12.1) is 


x'=B(x-vt), y'=y, 2'=z,  t'=8 (: = =) (12.2) 
with 


2 
v 
5-1 eee 


Such a transformation is called a Lorentz transformation. Strictly this is 
a proper Lorentz transformation as it does not involve an inversion in 
space or time. When v <c, (12.2) reduces to the classical Galilean 
transformation 


Uy 


x'=x— vt, y'=y, 2’ =z, t=. (12.3) 


The transformation (12.2) can be inverted to give 
x=B(x' tur),  ysy',z=z', ra (e +z). (12.4) 


A rotation of axis in three-dimensional space leaves the quantity 
x? +y? +z? invariant. If a co-ordinate T is defined by T =ict then a 
Lorentz transformation is a linear transformation that leaves the quantity 
x2 + y? +z? + T? invariant. A proper Lorentz transformation may be 
regarded as a rotation in four dimensional (Minkowski) space-time and 
the ordered 4-tuple (x, y, z, T) isa vector the length of which remains 
unchanged under a Lorentz transformation. Clearly the ordinary rotations 
in (x, y, z) space are a special type of Lorentz transformation. 

Special relativity requires that the laws of nature take the same form 
in all inertial frames of reference. This can be expressed mathematically by 
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demanding covariance of the appropriate equations under Lorentz 
transformations. This means that all terms in a given equation transform 


in the same way under a transformation and so have the same tensor 
rank, 


Consider the Schrédinger equation for a free particle of mass m. 


The operators V? and 2/d¢ are invariant under a linear transformation 
of (x, y, z) alone and so this equation is covariant under a spatial rotation. | 
However it is clearly not covariant under a Lorentz transformation of the | 
type (12.4). It may involve second derivatives involving the time after the 
transformation is applied. This result is not surprising as the Schrédinger 
equation is obtained from a non-relativistic, classical Hamiltonian. 


12.3 The Klein-Gordon Equation 


A relativistically invariant quantum theory must be based on a relativistic 
Hamiltonian. 

Consider a free particle of rest mass 719 moving relative to an observer 
with speed v. The ordered 4-tuple 


iff 
Px, Py, Pz,— 
c 


where the first three entries are the momentum components, 


Px =BmoxX Py=Bmoy pz = Bz 


2 
pa ffi-3 


and # is the Hamiltonian, transforms as a vector in four-dimensional 
space-time and its length is invariant under a Lorentz transformation so 
that 


H? 
p2 +p? +p? — a= —mic?, (12.5) 
The relativistic Hamiltonian is given by 
H? =(P2 +P? +P2)c? + moc’. (12.6) 


The quantum mechanical Hamiltonian is obtained by the usual substitution 
for the linear momentum components. 


H? = — c*h?V? + mic*. (12.7) 
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The usual time-dependent Schrédinger equation (postulate 3, Chapter 3) 
is impossible to apply because of the square root in the Hamiltonian. 
However, since the operator # is equivalent to the operator 


ne 
then a suitable equation of motion is 
aw 
2 = 74? — 12.8 
HU a7? (12.8) 
i.e. 
1 02h] mac? 
V2w — ——_|=—_®& w. 29 
e “ n oe) 


Note that this equation is second order in time. It is well-known that the 
operator 


32 
or? 


is invariant under a Lorentz transformation and so equation (12.9) is 
Lorentz covariant as mo and c are constant scalar quantities. 

This equation was discovered by Schrédinger (1926) before he 
discovered the non-relativistic equation that bears his name. However 
as it did not give good agreement with the known hydrogen spectrum 
he discarded it. Klein and Gordon (1926) studied equation (12.9) and it 
carries their names. 


J 
e=Vy? _— 
Vv 3 


Equation of continuity 


In section 5.5 it was shown that in the non-relativistic theory, the 
probability density P and the probability current density S satisfy the 
equation of continuity at each point of space, i.e. 


= +divS=0 (12.10) 


where 


=r 


and 


ih 
$= [¥ grad Y* — &* grad ¥]. (12.11) 
2mo 
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In the same way by multiplying the Klein-Gordon equation (12.9) by 
W* and subtracting from this result its complex conjugate, the equation 
of continuity (12.10) is again obtained. However in this case the 
probability density is 


h awe aw 
P=—— ae 2.1 
2imogc? (v ar ) eee 


with the probability current density taking the Schrédinger form (12.11). 
The Klein-Gordon probability density (12.12) introduces a difficulty 
that does not arise in the Schrédinger case. The Schrédinger P (12.11) is 
nowhere negative as expected for a probability density. Unfortunately 
this is not the case for the case for the Klein-Gordon probability density 
(12.12). This expression can be positive, zero or negative. This arises 
because the Klein-Gordon equation, unlike the Schrédinger equation, is 
second order in time and the values of Y and 0%/d¢ at a given instant can 
be arbitrary. Even if these values were chosen to make P > 0 at all points 
of space at some initial time it would not follow that P would not 
become negative at a later time. It was some time before this curious 
result was interpreted. The quantitites eP and eS can be regarded as the 
electric charge density and current density respectively and it is reasonable 
tor both to be positive or negative if both signs for charge occur. Also, at 
high energies there is the possibility of creation and annihilation of particle 
pairs so that the number of particles is not conserved. The charge density 
eP depends on the relative number of positive and negative charges and 
so may be positive or negative. It can be shown from the equation of 
continuity that the total charge is conserved, i.e. 


Jer dr = constant. 


The integral is over all space co-ordinates. 

In explaining the change in sign of eP and eS it has been necessary to 
allow the presence of both positive and negative (and neutral) particles 
and this is essentially a many-particle theory. Also, it can be shown that 
in the non-relativistic limit of low energies the Klein-Gordon equation 
reduces to the Schrédinger equation without spin. The solutions of the 
Klein-Gordon equation describe particles with zero-spin. Pions are such 
particles. However the Klein-Gordon equation certainly cannot describe 
particles with spin 4 and so cannot describe the motion of electrons. 
Dirac discovered the correct equation for electrons. 


12.4 The Dirac Equation 


By an ingenious method Dirac in 1928 succeeded in obtaining a linear 
relativistic equation which is first order in time. For a free particle, the 
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relativistic Hamiltonian is given by 

H? = (p2 + p32 + p2)c? + moc*, (12.13) 
where p,, Py, Pz represent the usual quantum operators. 


The basic postulate of Dirac was to express the right side of (12.13) 
as a perfect square so that the relativistic quantum Hamiltonian is 


H = c(AxDx +aypy + a, Dz + Bmgc) (12.14) 


with a, dy, &z, 8 independent of position, momentum and time. The 
negative root could be taken for # but this choice does not lead to any 
extra solutions. From (12.13) and (12.14) it is clearly necessary for 


c7(Azpx + Aypy +a,pz + Bmoc)* = (p2 + p32 + p2)c? + mac’. 
It follows immediately that 

6? =a2 = a3 =a? = (12.15a) 
and that any pair of a, ay, a,, 6 anticommute, i.e. 


Oy Ay + Oya, =0 
a,8 + Ba, =O, etc. (12.15b) 


It is clear from (12.15) that the a’s and 8 cannot be simply numbers 
as they do not commute. They are in fact operators and can be 
represented by square matrices @,,@,,a@, and B. The Hamiltonian is 


h 
H=c 7 @ grad +Bmoc? (12.16) 


where 


_. @ t) 
@.grad =a, ax ria an 


The time-dependent equation (3.10) becomes 


(: a. sna +Am @= ih ~ (12.17) 
Equation (12.17), which is first order in both space and time co-ordinates, 
is relativistically covariant and is Dirac’s equation for a free particle. It 
must be understood that ® is a column matrix representing a many- 
component wave-function. 

As the Hamiltonian must be Hermitian then the # and B matrices 
must also be Hermitian. Hermitian matrices that satisfy the commutation 
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relations (12.15) can only be found if they have at least 4 rows and 
columns, One possible choice is to diagonalize B with 


10 O O 
0 1 0 0 
B=) 4 9 -1 0 0 a) 
00 oO -1 
and then | 
0001 
0010 0 o | 
*"1o 10.0 -(? *} 
1000 | 
0 QO -i 
sie 0 O07 O -(° 2) (12.18) 
0-i 0 0 sy 0 
i 0 0 0 
0 1 0 
— 0 O31 -(° ) 
: 1 0 0 s, 0 
OST 262 <6 


where G,, 6,, 0, are the Pauli spin matrices. This choice is certainly not 
unique. Any new set of matrices obtained by premultiplying the above 
by an arbitrary unitary matrix S and post multiplying by S~! will also 
satisfy the necessary conditions. The choice (12.18) is called the Dirac 
representation. 

These matrices operate on a four-dimensional ‘spinor space’ which is 
spanned by a set of four basis functions. The Dirac wave-function 
satisfying (12.17) is represented in this space by a column matrix. 


Wir, 2) 

W2(r, 1) 
P(r, t)= 3 2.19 
St \ i) de 


Wa(r, t) 
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The W;(r, ¢) are the components of the wave-function in the ‘spinor- 
space’ and the Dirac equation (12.17) is equivalent to four simultaneous 
first order equations that are linear and homogeneous in Y,, ¥2, 3 and 
W,. Each of these component wave-functions separately satisfy the 
Klein-Gordon equation (12.9). 

For stationary states, the time dependence can be separated from the 
spatial part by writing 


Br, 1) = (rye 2 (12.20) 


where @(r, t) and ¢(r) are both four-component column matrices and 
from (12.17) 


(- 7. grad +Bmoe!) 49 = E(t). (12.21) 


Probability and current density 


The Dirac equation, unlike the Klein-Gordon equation, leads to a 
satisfactory equation of continuity with a non-negative probability 
density. The time-dependent Dirac equation for a free particle is 


a 
ih + icha . grad ® —Bmgc? P= 0. (12.17) 


This is a matrix equation and its Hermitian conjugate is (take the 
complex conjugate and transpose and note (a, )* =a,, etc.). 


+ 


a@ 
— ih > — ich grad @* a — moc’@'B= 0. (12.22) 


®* is the row matrix Hermitian conjugate to ®, i.e. 
(1, 2) = UFC, 1), FIG, 0, VEG, 1), VEG, 9). 


Observe that in obtaining (12.22), the order of matrices in the second 
and third terms of (12.17) has been reversed. 

The equation of continuity is obtained in the usual way. Premultiply 
(12.17) by St and postmultiply (12.22) by @ and subtract to obtain 


és 
ih (= = + = e| + ich(@*a. grad @+ grad B* aH) = 0 


3 
a: (®*®) +c div (Sad =0. _. (42.23) 
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This is the equation of continuity with probability density 


p= b= |v, 7 + |v, |? + [W5l? + 1v4/? (12.24) 
and probability current density 
S=cB'ad. (12.25) 


Equation (12.24) ensures that the probability density is nowhere 
negative and is the expected result. 

The expression representing the probability current density is very 
interesting as the eigenvalues of a are +1. The operator representing the 
velocity components of the particle is 


f=o [7,1] =ca (12.26) 


and so the eigenvalues of f are tc. An exact measurement of a velocity 
component must give tc. This represents a ‘shuddering’ of the particle 
with speed c. Trigg (Quantum Mechanics), examines this result in more 
detail and suggests that it is meaningless to specify the position of the 
particle to an accuracy of less than its Compton wave-length h/moc. In 
this case the uncertainty principle would require that the uncertainty 
in the energy should be of the order of 2mpoc? and this would be 
sufficient to produce a pair of particles (see last section). 

The equation of continuity (12.23), the probability density (12.24) 
and the probability current density (12.25) remain valid in the presence 
of an electromagnetic field. 


Covariance of the Dirac equation 


The Dirac equation can be shown to be covariant under a Lorentz 
transformation. The reader is referred to Group Theory and Quantum 
Mechanics, by Heine, and Quantum Mechanics, by Davydov, for a 
discussion. 


Plane wave solutions 


It is expected that the free-particle Dirac equation (12.17) will have 
plane wave solutions of the type 


W;(r, t) =a; exp i(k.: — Et/h) j=1,2,3,4 (12.27) 
with 
k.r=xk, + yky + 2k, (12.28) 
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and a; are constants. Substitution of (12.27) into (12.17) gives the four 
simultaneous equations 


(moc? — E)a, + chk,a3 + ch(kx — iky)a4 =0 
(moc? — E)az + ch(k, + iky)a3 — chkz,a4=0 
chk,a, + ch(ky — iky)az — (moc? +E)a3 =0 
ch(ky +iky)a, — chkza2 ~ (moc? + E)ag = 0. (12.29) 


For a non-trivial solution, the determinant of the coefficients of the 
a; must vanish, i.e. 


(E? — m3c* ~ c?h?k?)? =0 (12.30) 
where 

k? =k2 +k2 +k. (12.31) 
The energy can take the two values (both double roots) 

Ey = +(m2c* + c2n2k?)!? 

E_ =—(mgct + c?An?k?)? (12.32) 
There are no allowed energy states between + moc?. 


There are two linearly independent solutions corresponding to the 
positive energy E,. In a non-normalized form, they can be chosen to be 


chk, ch(k,, +iky) 
a,=1 =0 = —__4+— = 
; "3 “a moc? +E si moc? + Ey 
(12.33) 
and 
ch(k, — iky) —chk, 
a, = 0 = 1 =o = 
1 a2 a3 muerte a4 moc? +Es 
(12.34) 
Similarly, for the negative energy _ , there are two linearly 
independent solutions 
___achk, _ —ch(ky + iky) _ . 
Qa Moc? _ E_ a2 = moc? _E_ a3 1 a4 = 
_ —ch(k,, — iky) ____ chk, 
a oe aE a2 = ra er a3z=0 a4=1 
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In the non-relativistic limit E, ~ moc? and the positive energy solutions 
(12.33) have two ‘large’ components a), a2 and two ‘small’ components 
of order v/2c. The negative energy solutions then have two ‘small’ 
components a}, a2 and two ‘large’ components @3, aq. 


Continuous 
energy 
spectrum 


+myc? 


Gap 


Continuous 
energy 
spectrum 


Figure 12.2 Continuous energy spectrum. 


The energy gap between + mgc? occurs in ‘classical’ relativistic theory 
(12.6). However these negative energies can be ignored in classical theory 
as energies can only change continuously and a particle whose energy is 
positive at some instant will remain so. This is not the case in quantum 
theory as the particle energy can change discontinuously. The relation 
between positive and negative energies will be discussed later. 


Introduction of spin 


For a free particle it is expected that angular momentum will be con- 
served. However it is simple to show that orbital angular momentum is 
not a constant of the motion. 

In the matrix representation the operator representing the z- 
component of orbital angular momentum is 


hi-3. 3 
pretties 5 Be 12.36 
ae rx) 1226) 
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where I is the 4 x 4 unit matrix. The free particle Dirac Hamiltonian is 
ten 2 
H= c 7a. grad +Bmoc* |. (12.37) 


The second term in the Hamiltonian is diagonal and so commutes with 
£,. It is simple to show that the commutator 


3 
1 L,] =[e4a. ona 4622) 


oy ax 
a 3 
=— 2 — at 
h o(« By a, 2) 
=~ hc(aA grad),. (12.38) 


As &, and # do not commute then the z-component of orbital 
angular momentum is not conserved. The generalization of (12.38) to 
include the two other components gives 


[96 £)| =-h*c(aA grad). (12.39) 
Now consider the matrix operator 
1 00 0 
i 0 -1 0 0 o, 0 
o, = = : (12.40) 
0 oO 1 0 0 «a, 
0 OO -!1 
It is not difficult to show that the commutator 
h, 
Faw =+h*c(aA grad),. (12.41) 
By defining 
‘ so, 0 0 
o, = ( = a, = he (12.42) 
0 «4, 0 »s 
it can similarly be shown that 
h , 2 
H, 5 =+h*c(aA grad) (12.43) 


where o’ has the three components o,, oy and 6. 
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By combining (12.38) and (12.43) then 


|x. + 2 ‘| =0 (12.44) 


and 
L+ a a’ 
2 


is a constant of the motion. 
By analogy with the Pauli spin theory a spin operator S' is defined by 


S' = ; yf (12.45) 


and the total angular momentum J is defined by 
J=L+S'. (12.46) 


The components of both S’ and F satisfy the necessary commutation 
relations for angular momentum operators (10.4). From (12.44), total | 


wa 


angular momentum is conserved for a free particle. 

The operators o;, 3,, a}, have eigenvalues +1 corresponding to 
eigenvalues + /2 for the spin operators S,,, Sj, S;. The Dirac equations 
describes particles whose spin along any specified direction can only take 
the two values + #/2. An electron is such a particle with spin 3. (Other 
such particles are protons, neutrons, etc.) 

Consider the plane wave energy functions obtained in the previous 
section. In the non-relativistic limit, putting the small components equal 
to zero, the column vectors representing the two solutions with energy 
£4 are, from (12.33) 


1 (0) 
0 1 
and 
0 0 
0 0 


These are simultaneously eigenvectors of S; with eigenvalues +//2 and 
—h/2 respectively. Of course these vectors are not eigenvectors of S,, or 
Sy as these operators do not commute with S;. 

Similarly, the ‘non-relativistic’ eigenvectors with energy E_ are, from 
(12.35) 


and 


> 
- OO O&O O&O 
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and are simultaneously eigenvectors of S, with eigenvalues + #/2 and 
~—h/2 respectively. 


12.5 The Dirac Particle in an E.M. Field 


An electromagnetic field is described completely when the vector 
potential A and the scalar potential 9 are given. It can be shown 
(Classical Electricity and Magnetism, by Panofsky and Phillips), that the 
ordered 4-tuple 


ip 
[4 Ay,Az, c 
| is a vector in Minkowski space and its length is invariant under a Lorentz 
transformation. 
The momentum p,. conjugate to x for a particle (electron) with charge 
—e in an electromagnetic field is given by ((1.43) holds in relativistic 
mechanics) 


Py =Bimox — eA, (12.47) 


and similarly for the other two components. The ‘classical’ relativistic 
Hamiltonian for the particle in the field is obtained from the free particle 
Hamiltonian in the usual way by substituting for the linear momentum 
in terms of the new conjugate momentum (12.47) and also replacing the 
Hamiltonian by H + e¢. From (12.6) 


(H + ed)? =c? [(py + eAx)” +(py + eAy)? + (pz + eAz)"] + moc*. 


(12.48) 
A quantum-mechanical wave equation is obtained by the substitutions 
a 
H> ih— 
at 
and 
ad ud a i.e 
mer 
2 2 2 
ha 
ae a2 . ane 
(w +e) W=e (2 Ox te ) +( 3 +eAy) 
hd ates 
+ 7 ae +eA,} +mgc*|¥. (12.49) 


This relativistically covariant equation is the Klein-Gordon equation for 
a particle of charge —e in an electromagnetic field and will not be 
considered further. 
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The Dirac equation for a particle of charge —e in the field is obtained 
in a similar manner. From (12.17) 


c) h 
JZ 106 —ca, (Pena +ea) -pme']@=0. (12.50) 
This equation is also relativistically covariant but its solutions are not 


solutions of the Klein~Gordon equation unlike the field-free case. To see 
this operate on (12.50) using 


) h 
E ar tlegtca. (} grad + cA sBmee’ 
i 


to obtain a similar but different equation to (12.49). 
To find the stationary states of the Dirac equation put 


PE, =o” 
in (12.50) and then 
[iste ca-(# ea rea] -2moc*| 0). (12.51) 


It has already been demonstrated that in the non-relativistic limit the 
four-component Dirac wave-function can be divided into large and small 
components. With this limiting case in mind put 


_[% 
(1) = (%] 


where X, and X, are two-component column vectors. With this substituion 
equation (12.51) becomes 


h 
I[E +e¢ ~ moc”]x, -—co. (a6 ea)n =0 


h 
I[E +e¢+moc*]x, —co. (2 grad rea] X=9 (12.52) 
with 


h h a ha 
(2 grad +ea| = 0, (22 +04, + Oy (42 +24,) 
ha 
+0,(|-—-+ ; 
o(! as cAs] 


Consider a positive energy solution with 


E=moc? +E’. 
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Eliminating x, from (12.52) 


h 2 
|. (Pera + ea) 
F i 


—I =12'y:. : 


¢c 


In the non-relativistic limit of small kinetic energy and if the field is 
weak then E’ < moc? and e¢ < moc? and so 


1 1 f omat | 


(2moc? +E' + ed) “Qmoc2 | 2moc? 


Keeping the first term only of this binomial expansion (12.53) becomes 


1 h : ‘ 
c. > grad +eA]|] —le@}xX,=1F XY. (12.54) 


2g 


This is a two-component equation and gives the large components X;. 

In this non-relativistic limit the magnitude of the momentuni is given 
by mV where V is the electron speed and the second of the equations 
(12.52) becomes 


wil 
Xs 2M 


This result confirms that x, is ‘small’ compared with X,. As the operator 


h 
(? grad + ea) 


commutes with the components of o then it can be shown that 
(problem 6) 


h ‘ A 
le; grad rea) =1( Pana +e] via | (Pena rea] 
I 


h h : 
A (? grad + ea]| =I (2 grad + ea) +efio.curlA. (12.55) 


The magnetic flux 
B=curlA 


and so in the non-relativistic limit, the two large components of the 
Dirac equation satisfy 


I h 2 +h 
Paras Nurs + mn Le es t 
Ls (ene ea] sad t46 aly IE. (12.56) 
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This equation was suggested by Pauli for the electron. It is of course 
simply the equation obtained by combining the Schrodinger non- 
relativistic theory and the Pauli electron spin theory of Chapter 10. 

(c.f. (8.24), (10.24), (10.25)). The Pauli theory is obtained in the non- 
relativistic limit (i.e. small velocity) from the Dirac theory. The final term 
on the left hand side of (12.56) implies that the particle has an intrinsic 
magnetic moment of 


in agreement with the value assumed in Chapter 10 for the electron. 
Clearly the particles described by the Dirac equation are electrons. 


Coulomb field 
A Coulomb field due to a charge of +Ze can be described by 
+Ze 
A=0 = 
4néor 


The Dirac Hamiltonian is then 
h —Ze? 
H=c-a. + 2 

c ; a. grad +Brgc Anesr 
The solutions of the corresponding equation is dealt with in detail by 
Davydov (Quantum Mechanics, p. 266). It is sufficient here to point out 
that the total angular momentum as defined by (12.46) is conserved for 
a particle in a central field. The reader is asked to verify this himself. 


12.6 The Positron 


It has been shown that the Dirac equation has both positive and negative 
energy states and this leads to a difficulty in interpretation. These nega- 
tive energy states correspond to the negative square root of the classical 
‘relativistic’ expression (12.6). However, in classical theory a particle 
energy changes continuously and so a particle in a state with positive 
energy will remain so and the negative energy states can be ignored. This 


is not the case in quantum mechanics as a particle can make discontinuous 


jumps in energy and so cross the ‘forbidden’ gap 2mgc?. There are an 
infinite number of negative energy states. In this case it would seem 
reasonable to expect a Dirac particle to fall to lower energy states 
radiating continuously. However electrons are not observed to do this 
and some other interpretation of the negative energy states must be 
forthcoming. 
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In 1930 Dirac proposed that in the normal state the negative energy 
states are filled and assuming the particles obey the Pauli principle a 
particle with positive energy cannot jump to a state with negative 
energy. The charge density due to the filled negative energy states must 
be assumed to be unobservable. A vacuum is then a space in which all 
negative energy states are filled and all positive energy states are empty. 
Only deviations from the vacuum state are observed. 

A positive energy solution of the Dirac equation (12.50) corresponds 
to an ordinary negatively charged particle (electron). A negative energy 
state is only observed when it is empty. Now a filled negative energy 
state corresponds to a particle with both negative charge and negative 
mass. Consequently the observed empty negative energy state appears as 
a particle with equal positive charge and positive mass. This ‘anti-particle’ 
appears as a hole in the ‘sea’ of filled negative energy states. The energy 
of the anti-particle is positive as it corresponds to the energy required to 
lift a negatively charged particle out of the negative energy state. 

It is quite easy to relate the behaviour of an electron to its anti- 
particle. The Dirac equation for an electron of charge —e and rest mass 
Mg in an electromagnetic field is 


a h 
[ate 166 ca.(? gaa ven] -Bmact| @=0. (12.57) 


It is useful to choose a representation in which the & matrices are all 
real and Bis imaginary. A choice satisfying the conditions (12.15) is to 
take @,, and a, as before (12.18) and interchange @, and Bto obtain 


10 0 0 0 Oo -i 
pe 0 1 0 0 p= 0 Of 

00 -!l 0 0 -i 0 0 

00 0 -!1 i 00 0 


(12.58) 


With this choice take the complex conjugate of equation (12.57). 
ny 2 h 2 
— | il a +I(—e)¢-ca. 7 oad +(—e)A} —Bmgc* | ®* = 0. 
(12.59) 


If in (12.57) @ describes an electron with charge —e, then ®* 
describes a particle with equal mass but opposite charge +e. 
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Consider now the stationary states. The Dirac equation for an 
electron with energy E£ is 


E +led—ca. (" grad +ea] -Amoe*| P=0. (12.60) 


Take the complex conjugate of this equation. 


- foo +I(—e)d— ca. (2 grad + cea] -pmec*| @* =0,. 
(12.61) 


If in (12.60) ® describes an electron with negative energy (F <0) 
then @* describes a positively charged particle with positive energy. ; 
The electron and its antiparticle appear symmetrically in the theory. The 
pair of equations (12.60) and (12.61) should be considered together and 
the negative energy solutions of one correspond to the positive energy 
solutions of the other. 
From this discussion it is seen that the electron and its antiparticle 
possess the same mass, spin and differ only in the sign of their charge. 
The electron antiparticle was discovered in 1932 and is called the 
positron. | 
It is observed that a positron always appears as a pair together with 
an electron and that an energy greater than 2mmgc? is absorbed in the 
pair creation. This is explained as in pair creation an electron must be 
excited from a negative energy state through the forbidden gap of 
2moc? to a positive energy state. The empty ‘hole’ manifests itself as 
the positron and the electron in the positive energy state behaves as an 
ordinary electron. 
The reverse process takes place when an electron falls into an 
unoccupied negative energy state. This occurs as electron-positron 
annihilation and the energy is emitted as photons. 
To explain the negative energy state it is necessary to extend the 
single particle Dirac theory to a type of many-particle theory. This is not 
very satisfactory. The difficulties can be overcome using the method of 
quantized fields but this development is outside the scope of this text. 
The second quantization formalism mentioned in the previous chapter is 
necessary to cope with the annihilation and creation of particles. 


PROBLEMS 


1 Equation (12.9) is the Klein-Gordon equation for a free particle. 
Consider the transformation 


W(r, t) = Yo(r, 2) exp (—imoc?t/n). 
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In the non-relativistic limit E = E' + mgc? with E' < mgc?. Also in 
general 


Hence show that 
av _ —imoc? 


ar r Wo exp (—imgc?t/h) 


and obtain a similar expression for 02/dr?. 
Show further that in this approximation of E’<mgc? the Klein- 
Gordon equation becomes 
hn? _, IV 
——— WV, = —— 
rig Oo aE 


This is Schrédinger’s non-relativistic equation for a free particle. 


2 Using the same transformation as in the above question and in the 
same non-relativistic limit, show that the Klein-Gordon probability 
density (12.12) reduces to the Schrédinger probability density (12.11). 


3 Show that conditions (12.15) are necessary for the square of (12.14) 
to be equal to (12.13). 


4 Show that the Dirac matrices @,, ay, & and Bare both Hermitian 
and unitary. Show that all four matrices have eigenvalues of +1 only. 


5 Confirm that the total angular momentum operator (12.46) commutes 
with the Hamiltonian for a Dirac particle in a central field. 


6 If Aand B are two vector matrix operators that commute with the 
Pauli spin matrices show that 


(s. A)(o.B)=(A.B) +i[o.(AAB)] 


where o. A=0,A, +a,Ay +0,Az;, etc. Hence prove equation (12.55). 


References 


Davydov, A. S. Quantum Mechanics. Pergamon, Oxford (1965). 

Dirac, P. A. M. Proc. Roy. Soc. A117, 610 (1928). 

Dirac, P. A.M. Quantum Mechanics. O.U.P., Oxford (1947). 

Gordon, W. Z. Phys. 40, 117 (1926). 

Heine, V. Group Theory in Quantum Mechanics. Pergamon, Oxford 
(1960). 

Klein, O. Z. Phys. 37, 895 (1926). 


330 A FIRST COURSE IN QUANTUM MECHANICS 


Panofsky, W. K. H. and Phillips, Classical Electricity and Magnetism, 
Addison-Wesley, Cambridge, Mass. (1955). 

Schrédinger, E. Ann. d. Physik 81, 109 (1926). 

Trigg. G. L. Quantum Mechanics. Van Nostrand, London (1964). 


Suggested Further Reading 


Bethe, H. A. Intermediate Quantum Mechanics. Benjamin, New York 
(1964). 

Gottfried, K. Quantum Mechanics, Vol. 1. Benjamin, New York (1966). 

Mackey, G. W. The Mathematical Foundations of Quantum Mechanics. 
Benjamin, New York (1963). 

Merzbacher, E. Quantum Mechanics. John Wiley, New York (1961). 

Messiah, A. Quantum Mechanics, Vols. I and II. North-Holland, 
Amsterdam (1961; 1963). 

Trigg, G. L. Quantum Mechanics. Van Nostrand, Princeton (1964). 


Index 


Abelian group, 218, 228 
Absorption of radiation; 30, 93, 
193, 278 
Accidental degeneracy, 221 
Addition, angular momentun, 244, 
260 
Alkali atom spectra, 247, 292 
Allowed transition, 92, 196, 290 
Alpha-particle decay, 34, 82 
Alternating intensities in band 
spectra, 278 
Angular momentum, 42, 105, 258. 
addition, 244 
commutation relations, 106, 
223, 238, 259, 265, 267 
eigenvalues, 108, 225 
matrices, 141 
of systems, 258 
orbital, 225, 258 
spin, 237, 265 
total, 242, 267 
Anti-particle, 327 
Antisymmetric wave-functions, 268, 
270, 282 
Approximate methods, 152, 186 
Associated Legendre functions, 90, 
278 
Atomic terms, 271,286 . 
Average values, 64 
Axial rotation group, 218 
Azimuthal quantum number, 32, 
95, 247 


Band structure, 93, 277 
Basis vectors, 63, 214 
Bethe, H. A., 250 


Binding energy, 157, 275, 277 
Black-body, 21 
Bloch function, 229 
Bohr, N., 29 

atom, 29. 
Bohr-Sommerfeld conditions, 31, 

179 

Bond, 275 
Born, M, 46, 277 
Born-Oppenheimer, 277 
Born approximation, 200 
Bosons, 268, 301 


Calculus of variations, 12 
Canonical form of equations of 
motion, 7 
Cayley-Hamilton theorem, 147 
Central field approximation, 282, 
284 
Centre of mass, co-ordinates, 88 
co-ordinate system, 198 
Characteristic, function (see Eigen 
function) 
value (see Eigen value) 
Characters, 217, 230 
Chemical bond, 275 
Classes, 217, 218, 228 
Classical mechanics, 1 
Clebsch~Gordon coefficients, 261 
Collisions, 198 
Compatible observables, 101 
Complete sets, of functions, 64, 65 
of operators, 104, 260 
Compound pendulum, 4 
Compton effect, 35 


332 


Configuration, 285 
Conjugate momentum, 6, 141 
Conservation of energy, 9, 110 
Constants of motion, 10, 108 
Continuous spectrum, 29, 66 
Co-ordinates, generalized, 1 
rotations, 207 
Correspondence principle, 33 
Coulomb field, 30, 93, 326 
Covariance of the Dirac equation, 
318 
Creation operator, 295, 301 
Crystal field, 229 


D® 227, 263, 287, 291 
de Broglie, L., 37 
Debye, 28 
Degeneracy, 32, 221 
and perturbation theory, 170, 
229 
orbital, 95 
spin, 245 
Degrees of freedom, 1 
Delta function, 67, 190 
Density matrix, 145 
Determinantal wavefunction, 270, 
271, 276, 282, 294 
Diatomic molecule, 8, 277 
Dimension of vector space, 63 
Dirac, P. A. M., 43, 49, 314 
coulomb fieid, 326 
e.m. field, 323 
equation, 310 
plane wave solutions, 318 
D-\ines, sodium, 250, 292 
Dual nature of matter, 37 
Dynamical observable, 54, 56, 101 


Ehrenfest’s theorem, 116 
Eigenfunction, 45, 59, 221 
angular momentum, 108, 
225, 259, 261 
completeness of, 64 
degenerate, 108, 221 
energy, 61, 221 
spin angular momentum, 239, 
266 
Eigenvalue, 46, 59, 221 
degenerate, 64, 221 
of matrix, 132 
reality of, 60, 134 


INDEX 


Eikonal equation, 75 
Einstein, A., 24, 26 
Elastic collisions, 197 
Electric-dipole, approximation, 
195, 290 
selection rules, 196, 290 
Electromagnetic field, 11, 190, 204, 
323 
Electron field operators, 301 
Electron in an e.m. field, 11, 69, 
190, 323 
Electron spin, 236, 320 
resonance, 246 
Elements of a group, 210 
Energy level, 45 
breadth of, 197 
Coulomb field, 95 
harmonic oscillator, 84 
lowest, and uncertainty 
principle, 114 
rigid rotator, 90 
Equations of motion, in Hamiltonian 
form, 7 
in Lagrangian form, 4 
Equivalence of representations, 
139, 214 
Euler-Lagrange equation, 13 
Exchange energy, 271, 283, 287 
Excited states of helium atom, 272 
Exclusion principle, 49, 269, 284 
and periodic system, 284 
Expectation value, 64 


Fermat’s principle, 38, 75 

Fermi particles, 237, 268, 285, 294 

Fermi’s golden rule, 190 

Field, self-consistent, 279, 282 

Fine structure, 91, 96, 288 

Forebidden transitions, 196, 290 
first-order, 196 
strictly, 196 

Free electron model, 85 

Full rotational group, 222 
irreducible representations, 

225 
Function space, 63, 103, 213, 259, 
261 


g-factor, 250 
Generalized co-ordinates, 1 
Generalized momentum, 6 


INDEX 


Goldstein, H., 1, 19 

Good quantum number, 110, 259, 
263, 286, 289 

Goudsmit, S., 236 

Ground state of helium, 155, 183, 
271 

Group, definition of, 210 

Group velocity, 40, 116 

Gyromagnetic ratio, 250 


Half-integral quantum numbers, 34, 
180, 243 
Hamiltonian, classical, 6 
of free atom, 93 
operator, 48 
Hamilton-Jacobi equation, 14, 74, 
75 
Hamilton’s equations, 5 
characteristic function, 14 
Harmonic oscillator, 22, 32, 83 
classical limit, 85 
Hartree, D. R., 279 
self consistent field, 279 
~Fock self consistent field, 
282 
Heisenberg, W., 42 
picture, 143 
uncertainty principle, 48, 85, 
112, 197 
Heitler-London wave function, 276 
Helium atom, 155, 271 
excited state, 272 
ground state, 155, 183, 271 
Hermitian, matrix, 132, 315 
operator, 56 
Hertz, H. R., 25 
effect, 25 
Hilbert space, 65 
Holonomic system, 1 
Hund’s rules, 288 
Hydrogen, atom, 93 
molecular ion, 159, 275 
molecule, 274 


Identical particles, 267 
Identity, operation, 211 
representation, 217 
Impact parameter, 198 
Improper rotations, 209 
Independent particles, 269 
Indistinguishable particles, 268 


333 


Induced emission, 194, 195, 290 
Infinitesimal rotation operators, 
222, 259, 265, 267 
commutation relations, 223 
and angular momentum, 223 
and spin, 238, 265 
total, 243, 267 
Infrared spectra, 91, 278 
Inversion operator, 263 
and parity, 263 
Irreducible representations, 214, 
221 


J-J coupling, 290 


Kemble, E. C., 95, 178 
Klein-Gordon equation, 312, 323 
Kronecker delta, 10 


Laboratory co-ordinate system, 198 
Ladder operator, 225 
Lagrange’s equations, 4 
Lagrangian, 4 
charged particle, 11 
Landé g-factor, 250 
Laporte’s rule, 290 
Line breadth, 197 
Linear combinations, 65, 157 
Linear independence, 62, 214 
Linear momentum, 62, 222 
Lorentz, group, 310 
transformation, 310 


Magnetic field, Hamiltonian, 173 
Zeeman effect, 173, 229 
Magnetic moment, 120 
electron spin, 245, 326 
orbital, 120 
Magnetic quantum number, 32, 94, 
174 
Matrices, 43 
eigenvalues, 132 
Hermitian, 132 
spin, 241, 316 
unitary, 132 
Mixed state, 145 
Molecule, hydrogen, 274 
Momentum, generalized, 6 
orbital, 222, 258 
representation, 58 
spin, 238, 265 


334 BeatriceGloria_personal library 


Non-interacting particles, 269 
Non-relativistic limit, 320, 326 
Nuclear shell model, 286 
Number operator, 296 


Observable, 54, 56, 101 
Olid quantum, theory, 23 


One-dimensional problems, 45, 77, 


228 
Operator, 55, 209 


angular momentum, 69, 223, 


258 
Hermitian, 56 
parity, 263 
spin, 238, 265 
Oppenheimer, J. R., 277 
Orbit penetration, 247 
Orbital, angular momentum, 225, 
258 
atomic, 95, 284 
molecular, 275 
Orthogonal functions, 59, 60, 62 
Ortho-hydrogen, 278 
Orthonormal, basis, 63 
functions, 60, 63 
Oscillator, harmonic, 22, 32, 83 


Para-hydrogen, 278 
Paramagnetic ions, 229 
Parity, 263 
Particle in a box, 45, 85, 220 
Paschen-Back effect, 253 
Pauli, spin matrices, 241, 316, 325 
exclusion principle, 269 
Periodic system, 284 
Perturbation, of degenerate levels, 
170 
time-dependent, 186 
time-independent, 163 
splitting of levels, 173, 229 
Phase velocity, 38, 40 
Photo-electric effect, 24 
Planck, M., 23 
constant, 23, 57 
Poisson brackets, 9, 57, 108 
Positron, 326 
Probability current density, 118, 
313 
Dirac, 317 
Klein-Gordon, 314 


INDEX 


ea density, 46, 64, 313, 
317 

Product, space, 259 
representation, 262 

Proton spin, 237, 277 

Pure state, 145 


Quantum numbers, 23, 32, 59 
atomic n, l, m, 94 
atomic, j, 244 
L, S, J, 259, 261, 266, 267 


Radiation absorption, 30, 93, 193, 
278 

Rare gases, 285 

Rayleigh-Jeans law, 22 

Reduced mass, 89, 94, 191, 250 

Reduction of a representation, 215 

Relativistic wave equation, 49, 313, 
315 

Representative ensemble, 145 

Rigid rotator, 8, 88 

Ritz combination principle, 29 

Rotational spectra, 91, 277 

Russel-Saunder’s coupling, 289 

Rutherford atom, 29 


Scattering, angle, 198 
cross-section, 198 
Schrédinger, E., 43 
representation, 57 
time-dependent equation, 46, 
59, 74 
time-independent equation, 
45, 61,77 
Schwarz inequality, 111 
Scleronomic system, 2 
Second quantization, 293 
Secular equation, 159 
Selection rules, 92, 196, 290 
Self-consistent field, 279, 282 
Semi-classical treatment of radiation, 
190 
Simultaneous eigenfunctions, 101 
Single-valued wave-functions, 55, 
90, 94, 239 
Singlet states, 266, 276 
Sommerfeld, 31, 42, 179 
Specific heat, 26 
Spin, electron, 49 
magnetic moment, 245, 326 


INDEX 335 


nuclear, 247, 277 Turning point in W. K. B. approxi- 
orbit coupling, 247, 288 mation, 176 
spectroscopy, 246 
Spinor, 239 Uhlenbeck, G. E., 236 
Spontaneous emission, 196 Ultra-violet catastrophe, 23 
Stark effect, 169 Uncertainty principle, 48, 85, 112, 197 
State function, 55 Unit matrix, 130 


Stationary state, 62, 110 
Stefan’s law, 21 

Stern-Gerlach experiment, 236 
Symmetric wave function, 268, 


Van der Waals interaction, 167 
Variation of constants, 186 
Variational method, 152 


272, 276, 277 Virial theorem, 154 

Wave equation, 59, 61, 315 
Terms, 271, 286 Wave function, 45 
Thomas factor, 248 for helium, 156, 271 
Time-dependent perturbation, 186 for hydrogen atom, 93, 95 
Time-dependent perturbation, for hydrogen molecule, 274 

for hydrogen molecular-i 

Total angular momentum, 242, 267 160 8 oreeular ion, 
Trace, 131, 150, 214 Wave packet, 41, 114 
Transformation, 137, 207 W. K. B. approximation, 174 


Transition probability, 189, 195, Wien displacement law, 22 
231, 290 , 


Transpose matrix, 129 Zeeman effect, 173 
Trial wave-function, 153 strong-field, 253 
Triplet states, 266, 273, 277 weak-field, 173 


Tunneling, 79 Zero point energy, 85, 180 


Pe 
' 


OEE pei a hitb dp 
Pett Spree ee Pes 
Me ig etre 

oY ak 


- r 7 
rq ete ~tte rer 
J ey F dy o es 
- id : ” “gr 
es he & . me ; - 


Ut ee 


